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AN INTRODUCTION TO THE STANDARD MODEL OF 
PARTICLE PHYSICS 

Second Edition 


The Standard Model of particle physics is the mathematical theory that describes 
the weak, electromagnetic and strong interactions between leptons and quarks, the 
basic particles of the Standard Model. 

The new edition of this introductory graduate textbook provides a concise but 
accessible introduction to the Standard Model. It has been updated to account for 
the successes of the theory of strong interactions, and the observations on matter- 
antimatter asymmetry. It has become clear that neutrinos are not mass-less, and this 
book gives a coherent presentation of the phenomena and the theory that describes 
them. It includes an account of progress in the theory of strong interactions and of 
advances in neutrino physics. The book clearly develops the theoretical concepts 
from the electromagnetic and weak interactions of leptons and quarks to the strong 
interactions of quarks. 

This textbook provides an up-to-date introduction to the Standard Model for 
graduate students in particle physics. Each chapter ends with problems, and hints to 
selected problems are provided at the end of the book. The mathematical treatments 
are suitable for graduates in physics, and more sophisticated mathematical ideas 
are developed in the text and appendices. 

noel cottingham andDEREK gree n wood are theoreticians working in the 
H. H. Wills Physics Laboratory at the University of Bristol. They have published two 
undergraduate texts with Cambridge University Press, Electricity and Magnetism 
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Preface to the second edition 


In the eight years since the first edition, the Standard Model has not been seriously 
discredited as a description of particle physics in the energy region (<2 TeV) so 
far explored. The principal discovery in particle physics since the first edition is 
that neutrinos carry mass. In this new edition we have added chapters that extend 
the formalism of the Standard Model to include neutrino fields with mass, and we 
consider also the possibility that neutrinos are Majorana particles rather than Dirac 
particles. 

The Large Hadron Collider (LHC) is now under construction at CERN. It is 
expected that, at the energies that will become available for experiments at the 
LHC (~20 TeV), the physics of the Higgs field will be elucidated, and we shall 
begin to see ‘physics beyond the Standard Model’. Data from the ‘B factories’ will 
continue to accumulate and give greater understanding of CP violation. We are 
confident that interest in the Standard Model will be maintained for some time into 
the future. 

Cambridge University Press have again been most helpful. We thank Miss V. K. 
Johnson for secretarial assistance. We are grateful to Professor Dr J. G. Korner 
for his corrections to the first edition, and to Professor C. Davies for her helpful 
correspondence. 
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Preface to the first edition 


The ‘Standard Model’ of particle physics is the result of an immense experimental 
and inspired theoretical effort, spanning more than fifty years. This book is intended 
as a concise but accessible introduction to the elegant theoretical edifice of the 
Standard Model. With the planned construction of the Large Hadron Collider at 
CERN now agreed, the Standard Model will continue to be a vital and active subject. 

The beauty and basic simplicity of the theory can be appreciated at a certain 
‘classical’ level, treating the boson fields as true classical fields and the fermion 
fields as completely anticommuting. To make contact with experiment the theory 
must be quantised. Many of the calculations of the consequences of the theory are 
made in quantum perturbation theory. Those we present are for the most part to the 
lowest order of perturbation theory only, and do not have to be renormalised. Our 
account of renormalisation in Chapter 8 is descriptive, as is also our final Chapter 19 
on the anomalies that are generated upon quantisation. 

A full appreciation of the success and significance of the Standard Model requires 
an intimate knowledge of particle physics that goes far beyond what is usually taught 
in undergraduate courses, and cannot be conveyed in a short introduction. However, 
we attempt to give an overview of the intellectual achievement represented by the 
Model, and something of the excitement of its successes. In Chapter 1 we give a 
brief resume of the physics of particles as it is qualitatively understood today. Later 
chapters developing the theory are interspersed with chapters on the experimental 
data. The amount of supporting data is immense and so we attempt to focus only on 
the most salient experimental results. Unless otherwise referenced, experimental 
values quoted are those recommended by the Particle Data Group (1996). 

The mathematical background assumed is that usually acquired during an under¬ 
graduate physics course. In particular, a facility with the manipulations of matrix 
algebra is very necessary; Appendix A provides an aide-memoire. Principles of 
symmetry play an important role in the construction of the model, and Appendix B 
is a self-contained account of the group theoretic ideas we use in describing these 

xiii 
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XIV 


Preface to the first edition 


symmetries. The mathematics we require is not technically difficult, but the reader 
must accept a gradually more abstract formulation of physical theory than that pre¬ 
sented at undergraduate level. Detailed derivations that would impair the flow of 
the text are often set as problems (and outline solutions to these are provided). 

The book is based on lectures given to beginning graduate students at the Uni¬ 
versity of Bristol, and is intended for use at this level and, perhaps, in part at least, 
at senior undergraduate level. It is not intended only for the dedicated particle 
physicist: we hope it may be read by physicists working in other fields who are 
interested in the present understanding of the ultimate constituents of matter. 

We should like to thank the anonymous referees of Cambridge University Press 
for their useful comments on our proposals. The Department of Physics at Bristol 
has been generous in its encouragement of our work. Many colleagues, at Bristol 
and elsewhere, have contributed to our understanding of the subject. We are grateful 
to Mrs Victoria Parry for her careful and accurate work on the typescript, without 
which this book would never have appeared. 
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Notation 


Position vectors in three-dimensional space are denoted by r = (jt, y, z), or x = 
(x 1 , x 2 , x 3 ) where x 1 = x, x 2 = y, x 3 = z. 

A general vector a has components (a 1 , a 2 , a 3 ), and a denotes a unit vector in 
the direction of a. 

Volume elements in three-dimensional space are denoted by d 3 x = dxdydz = 
dx'dx 2 dx 3 . 

The coordinates of an event in four-dimensional time and space are denoted by 
x = (x°, x 1 , x 2 , x 3 ) = (x°, x) wherex° — ct. 

Volume elements in four-dimensional time and space are denoted by d 4 x = 
dx°dx I dx 2 dx 3 — cdtd 3 x. 

Greek indices |x, v, p take on the values 0, 1, 2, 3. 

Latin indices i, j, k, l take on the space values 1, 2, 3. 


Pauli matrices 


We denote by the set (o - * 3 , c 1 , cr 2 , ct 3 ) and by cf 11 the set (a 0 , —ct 1 , —ct 2 , —ct 3 ), 
where 


CT 


= I — 




(ct 1 ) 2 = (ct 2 ) 2 = (ct 3 ) 2 = I; ct'ct 2 = icr 3 — — ct 2 ct 1 , etc. 




Chiral representation for y-matrices 

0 I\ ; /0 CT 1 


7 ° = 


I 0 


T = 


5 -0123 

y =iy y y y = 


-CT 1 0 

I 0 
I 


xv 
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Quantisation (M = c = 1) 

(E, p) —> (i3/3 t, —iV), or p ^ —>• iS^. 

For a particle carrying charge q in an external electromagnetic field, 
(E , p) —> (£ — p — gA), or -> — gA^, 

i3^ -> (i3^ - qA^) = i( 3 M- + \qA*). 


Field definitions 

Zp. = V cos e w - sin 0 W , 

Ap, = Wjj , 3 sin 0 W + B jj, cos 0 W , 

where sin 2 0 W = 0.2315(4) 

82 sin 0 W = gi cos 0 W = e, G ¥ = g 2 2 /(4v / 2M w 2 ). 

Glossary of symbols 

A electromagnetic vector potential Section 4.3 

4 |x electromagnetic four-vector potential 

A^ field strength tensor Section 11.3 

A F b forward-backward asymmetry Section 15.2 

a wave amplitude Section 3.5 

a, o' boson annihilation, creation operator 

B magnetic field 

B ^ gauge field Section 11.1 

B^ v field strength tensor Section 11.2 

b. b^ fermion annihilation, creation operator 

D isospin doublet Section 16.6 

d, d^ antifermion annihilation, creation operator 

dk (k — 1,2,3) down-type quark field 

E electric field 

E energy 

e, e ¥ , <?r electron Dirac, two-component left-handed, right-handed field 

F^ v electromagnetic field strength tensor Section 4.1 

/ radiative corrections factor Sections 15.1, 17.4 

f a b c structure constants of SU(3) Section B.7 

G 11 gluon matrix gauge field 

G^ v gluon field strength tensor 

Gp Fermi constant Section 9.4 
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metric tensor 

strong coupling constant Section 16.1 
electroweak coupling constants 
Hamiltonian Section 3.1 
Higgs field 

Hamiltonian density Section 3.3 
isospin operator Sections 1.5, 16.6 
electric current density Section 4.1 
total angular momentum operator 
Jarlskog constant Section 14.3 
lepton number current Section 12.4 
probability current Section 7.1 
lepton current Section 12.2 
string tension Section 17.1 
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Lagrangian density Section 3.3 

normalisation volume Section 3.5 

left-handed spinor transformation matrix Section B.6 

proton mass Section D. 1 

mass 

right-handed spinor transformation matrix Section B.6 
number operator Section C.l 
quantum operator 
total field momentum 
momentum 
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quark colour triplet 

energy-momentum transfer 

rotation matrix Section B.2 

spin operator 

action Section 3.1 

square of centre of mass energy 
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unitary matrix 
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1 

The particle physicist’s view of Nature 


1.1 Introduction 

It is more than a century since the discovery by J. J. Thomson of the electron. The 
electron is still thought to be a structureless point particle, and one of the elementary 
particles of Nature. Other particles that were subsequently discovered and at first 
thought to be elementary, like the proton and the neutron, have since been found to 
have a complex structure. 

What then are the ultimate constituents of matter? How are they categorised? 
How do they interact with each other? What, indeed, should we ask of a mathemat¬ 
ical theory of elementary particles? Since the discovery of the electron, and more 
particularly in the last sixty years, there has been an immense amount of experi¬ 
mental and theoretical effort to determine answers to these questions. The present 
Standard Model of particle physics stems from that effort. 

The Standard Model asserts that the material in the Universe is made up of 
elementary fermions interacting through fields, of which they are the sources. The 
particles associated with the interaction fields are bosons. 

Four types of interaction field, set out in Table 1.1., have been distinguished in 
Nature. On the scales of particle physics, gravitational forces are insignificant. The 
Standard Model excludes from consideration the gravitational field. The quanta of 
the electromagnetic interaction field between electrically charged fermions are the 
massless photons. The quanta of the weak interaction fields between fermions are 
the charged W + and W“ bosons and the neutral Z boson, discovered at CERN in 
1983. Since these carry mass, the weak interaction is short ranged: by the uncertainty 
principle, a particle of mass M can exist as part of an intermediate state for a time 
h/Mc 2 , and in this time the particle can travel a distance no greater than he/Me. 
Since M w ~ 80GeV/c 2 and M z ~ 90GeV/c 2 , the weak interaction has a range 
^ 10“ 3 fm. 


1 
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2 The particle physicist’s view of Nature 


Table 1.1. Types of interaction field 


Interaction field 

Boson 

Spin 

Gravitational field 

‘Gravitons’ postulated 

2 

Weak field 

W + , W~, Z particles 

1 

Electromagnetic field 

Photons 

1 

Strong field 

‘Gluons’ postulated 

1 


The quanta of the strong interaction field, the gluons, have zero mass and, like 
photons, might be expected to have infinite range. However, unlike the electromag¬ 
netic field, the gluon fields are confining, a property we shall be discussing at length 
in the later chapters of this book. 

The elementary fermions of the Standard Model are of two types: leptons and 
quarks. All have spin in units of h, and in isolation would be described by 
the Dirac equation, which we discuss in Chapters 5, 6 and 7. Leptons interact 
only through the electromagnetic interaction (if they are charged) and the weak 
interaction. Quarks interact through the electromagnetic and weak interactions and 
also through the strong interaction. 


1.2 The construction of the Standard Model 

Any theory of elementary particles must be consistent with special relativity. The 
combination of quantum mechanics, electromagnetism and special relativity led 
Dirac to the equation now universally known as the Dirac equation and, on quan¬ 
tising the fields, to quantum field theory. Quantum field theory had as its first 
triumph quantum electrodynamics, QED for short, which describes the interaction 
of the electron with the electromagnetic field. The success of a post-1945 genera¬ 
tion of physicists, Feynman, Schwinger, Tomonaga, Dyson and others, in handling 
the infinities that arise in the theory led to a spectacular agreement between QED 
and experiment, which we describe in Chapter 8. 

The Standard Model, like the QED it contains, is a theory of interacting fields. 
Our emphasis will be on the beauty and simplicity of the theory, and this can be 
understood at a certain ‘classical’ level, treating the boson fields as true classical 
fields, and the fermion fields as completely anticommuting. To make a judgement 
of the success of the model in describing the data, it is necessary to quantise the 
fields, but to keep this book concise and accessible, results beyond the lowest orders 
of perturbation theory will only be quoted. 

The construction of the Standard Model has been guided by principles of sym¬ 
metry. The mathematics of symmetry is provided by group theory; groups of 
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1.3 Leptons 
Table 1.2. Leptons 


3 



Mass (MeV/c 2 ) 

Mean life (s) 

Electric charge 

Electron e” 

0.5110 

oo 

—e 

Electron neutrino -v e 

< 3 x 10“ 6 


0 

Muon u~ 

105.658 

2.197 x 10" 6 

—e 

Muon neutrino ip 



0 

Tau t~ 

1777 

(291.0 ± 1.5) x 10~ 15 

—e 

Tau neutrino v T 



0 


For neutrino masses see Chapter 20. 


particular significance in the formulation of the Model are described in Appendix B. 
The connection between symmetries and physics is deep. Noether’s theorem states, 
essentially, that for every continuous symmetry of Nature there is a correspond¬ 
ing conservation law. For example, it follows from the presumed homogeneity of 
space and time that the Lagrangian of a closed system is invariant under uniform 
translations of the system in space and in time. Such transformations are therefore 
symmetry operations on the system. It may be shown that they lead, respectively, 
to the laws of conservation of momentum and conservation of energy. Symmetries, 
and symmetry breaking, will play a large part in this book. 

In the following sections of this chapter, we remind the reader of some of the 
salient discoveries of particle physics that the Standard Model must incorporate. In 
Chapter 2 we begin on the mathematical formalism we shall need in the construction 
of the Standard Model. 


1.3 Leptons 

The known leptons are listed in Table 1.2.. The Dirac equation for a charged massive 
fermion predicts, correctly, the existence of an antiparticle of the same mass and 
spin, but opposite charge, and opposite magnetic moment relative to the direction of 
the spin. The Dirac equation for a neutrino ~v allows the existence of an antineutrino 

"V. 

Of the charged leptons, only the electron e“ carrying charge —e and its antipar¬ 
ticle e + , are stable. The muon p“ and tau t“ and their antiparticles, the p+ and t + , 
differ from the electron and positron only in their masses and their finite lifetimes. 
They appear to be elementary particles. The experimental situation regarding small 
neutrino masses has not yet been clarified. There is good experimental evidence 
that the e, p and t have different neutrinos "V e , -v u and associated with them. 

It is believed to be true of all interactions that they preserve electric charge. It 
seems that in its interactions a lepton can change only to another of the same type, 
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4 The particle physicist’s view of Nature 


Table 1.3. Properties of quarks 


Quark 

Electric charge (e) 

Mass (xc 2 ) 

Up u 

2/3 

1.5 to 4 MeV 

Down d 

-1/3 

4 to 8 MeV 

Charmed c 

2/3 

1.15 to 1.35 GeV 

Strange s 

-1/3 

80 to 130 MeV 

Top t 

2/3 

169 to 174 GeV 

Bottom b 

-1/3 

4.1 to 4.4 GeV 


and a lepton and an antilepton of the same type can only be created or destroyed 
together. These laws are exemplified in the decay 

p. - + e“ + V e . 

Apart from neutrino oscillations (see Chapters 19-21). This conservation of lepton 
number, antileptons being counted negatively, which holds for each separate type 
of lepton, along with the conservation of electric charge, will be apparent in the 
Standard Model. 


1.4 Quarks and systems of quarks 

The known quarks are listed in Table 1.3. In the Standard Model, quarks, like 
leptons, are spin j Dirac fermions, but the electric charges they carry are 2e/3, 
—e/3. Quarks carry quark number, antiquarks being counted negatively. The net 
quark number of an isolated system has never been observed to change. However, 
the number of different types or flavours of quark are not separately conserved: 
changes are possible through the weak interaction. 

A difficulty with the experimental investigation of quarks is that an isolated quark 
has never been observed. Quarks are always confined in compound systems that 
extend over distances of about 1 fm. The most elementary quark systems are baryons 
which have net quark number three, and mesons which have net quark number zero. 
In particular, the proton and neutron are baryons. Mesons are essentially a quark 
and an antiquark, bound transiently by the strong interaction field. The term hadron 
is used generically for a quark system. 

The proton basically contains two up quarks and one down quark (uud), and the 
neutron two down quarks and one up (udd). The proton is the only stable baryon. 
The neutron is a little more massive than the proton, by about 1.3 MeV/c 2 , and 
in free space it decays to a proton through the weak interaction: n —> p + e“+ "v e , 
with a mean life of about 15 minutes. 
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All mesons are unstable. The lightest mesons are the 7t-mesons or ‘pions’. The 
electrically charged n + and n~ are made up of (ud) and (ud) pairs, respectively, 
and the neutral 7t° is either uu or dd, with equal probabilities; it is a coherent 
superposition (uu — dd)/V2 of the two states. The n + and n~ have a mass of 
139.57 MeV/c 2 and the 7t° is a little lighter, 134.98 MeV/c 2 . The next lightest 
meson is the r\ (~ 547 MeV/c 2 ), which is the combination (uu + dd)/of quark- 
antiquark pairs orthogonal to the 7t°, with some ss component. 


1.5 Spectroscopy of systems of light quarks 

As will be discussed in Chapter 16, the masses of the u and d quarks are quite small, 
of the order of a few MeV/c 2 , closer to the electron mass than to a meson or baryon 
mass. A u or d quark confined within a distance ~ 1 fm has, by the uncertainty 
principle, a momentum p ~ h/(\ I'm) ~ 200 MeV/c, and hence its energy is E ~ 
pc ~ 200 MeV, almost independent of the quark mass. All quarks have the same 
strong interactions. As a consequence, the physics of light quark systems is almost 
independent of the quark masses. There is an approximate 5(7(2) isospin symmetry 
(Section 16.6), which is evident in the Standard Model. 

The symmetry is not exact because of the different quark masses and different 
quark charges. The symmetry breaking due to quark mass differences prevails over 
the electromagnetic. In ah cases where two particles differ only in that a d quark is 
substituted for a u quark, the particle with the d quark is more massive. For example, 
the neutron is more massive than the proton, even though the mass, ~ 2 MeV/c 2 , 
associated with the electrical energy of the charged proton is far greater than that 
associated with the (overall neutral) charge distribution of the neutron. We conclude 
that the d quark is heavier than the u quark. 

The evidence for the existence of quarks came first from nucleon spectroscopy. 
The proton and neutron have many excited states that appear as resonances in 
photon-nucleon scattering and in pion-nucleon scattering (Fig. 1.1). Fladron states 
containing light quarks can be classified using the concept of isospin. The u and d 
quarks are regarded as a doublet of states |u) and |d), with I = 1/2 and It, = +1/2, 
-1/2, respectively. The total isospin of a baryon made up of three u or d quarks is 
then I = 3/2 or I = 1/2. The isospin 3/2 states make up multiplets of four states 
almost degenerate in energy but having charges 2c(uuu), c(uud), 0(udd), —e(ddd). 
The I — 1/2 states make up doublets, like the proton and neutron, having charges 
e(uud) and 0(udd). The electric charge assignments of the quarks were made to 
comprehend this baryon charge structure. 

Energy level diagrams of the I — 3/2 and 7=1/2 states up to excitation energies 
of 1 GeV are shown in Fig. 1.2. The energy differences between states in a multiplet 
are only of the order of 1 MeV and cannot be shown on the scale of the figure. The 
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E y ( MeV) 

Figure 1.1 The photon cross-section for hadron production by photons on protons 
(dashes) and deuterons (crosses). The difference between these cross-sections is 
approximately the cross-section for hadron production by photons on neutrons. 
(After Armstrong et al. (1972).) 


widths T of the excited states are however quite large, of the order of 100 MeV, 
corresponding to mean lives t = h/ T ~ 10 -23 s. The excited states are all energetic 
enough to decay through the strong interaction, as for example A ++ —> p + 7t + 
(Fig. 1.3). 
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Figure 1.2 An energy-level diagram for the nucleon and its excited states. The 
levels fall into two classes: isotopic doublets (I — 1/2) and isotopic quartets (I — 
3/2). The states are labelled by their total angular momenta and parities ] p . The 
nucleon doublet N(939) is the ground state of the system, the A(1232) is the lowest 
lying quartet. Within the quark model (see text) these two states are the lowest that 
can be formed with no quark orbital angular momentum (L = 0). The other states 
designated by unbroken lines have clear interpretations: they are all the next most 
simple states with L = 1 (negative parity) and L — 2 (positive parity). The broken 
lines show states that have no clear interpretation within the simple three-quark 
model. They are perhaps associated with excited states of the gluon fields. 
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Table 1.4. Isospin quantum numbers 
of light quarks 


Quark 

Isospin 7 

h 

u 

1/2 

1/2 

u 

1/2 

-1/2 

d 

1/2 

-1/2 

d 

1/2 

1/2 

s 

0 

0 

s 

0 

0 



Figure 1.3 A quark model diagram of the decay A ++ -> p + n + . The gluon field 
is not represented in this diagram, but it would be responsible for holding the quark 
systems together and for the creation of the dd pair. 


The rich spectrum of the baryon states can largely be described and understood 
on the basis of a simple ‘shell’ model of three confined quarks. The lowest states 
have orbital angular momentum L = 0 and positive parity. The states in the next 
group have L = 1 and negative parity, and so on. However, the model has the curious 
feature that, to fit the data, the states are completely symmetric in the interchange 
of any two quarks. For example, the A ++ (uuu), which belongs to the lowest I = 
3/2 multiplet, has / p = 3/2 + . If L = 0 the three quark spins must be aligned f f f 
in a symmetric state to give J = 3/2, and the lowest energy spatial state must be 
totally symmetric. Symmetry under interchange is not allowed for an assembly of 
identical fermions! However, there is no doubt that the model demands symmetry, 
and with symmetry it works very well. The resolution of this problem will be left 
to later in this chapter. There are only a few states (broken lines in Fig. 1.2) that 
cannot be understood within the simple shell model. 

Mesons made up of light u and d quarks and their antiquarks also have a rich 
spectrum of states that can be classified by their isospin. Antiquarks have an A of 
opposite sign to that of their corresponding quark (Table 1.4.). By the rules for the 
addition of isospin, quark-antiquark pairs have / — 0 or / = 1. The 7 = 0 states 
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(a) 


(b) Mass (c) 

(GeV) 




States -are predominantly ss 



L- 0.0 


7=0 1=1 /= j 

Figure 1.4 States of the quark-antiquark system uu, ud, du, dd form isotopic triplets 
(/ = 1) : ud, (uu — dd)/-y/2, du; and also isotopic singlets (I = 0) : (uu + dd)/\/2. 
Figure 1.4(a) is an energy-level diagram of the lowest energy isosinglets, including 
states — which are interpreted as ss states. Figure 1.4(b) is an energy-level diagram 
of the lowest energy isotriplets. Figure 1.4(c) is an energy-level diagram of the 
lowest energy K mesons. The K mesons are quark-antiquark systems us and ds; 
they are isotopic doublets, as are their antiparticle states su and sd. Their higher 
energies relative to the states in Fig. 1 .4(b) are largely due to the higher mass of 
the s over the u and d quarks. The large relative displacement of the 0 + state is a 
feature with, as yet, no clear interpretation. 


are singlets with charge 0, l ik e the r\ (Fig. 1.4(a)). The 7—1 states make up triplets 
carrying charge +e, 0, —e, which are almost degenerate in energy, like the triplet 

7T + , 7T°, 7T _ . 

The spectrum of / = 1 states with energies up to 1.5 GeV is shown in Fig. 1 .4(b). 
As in the baryon case the splitting between states in the same isotopic multiplet 
is only a few MeV; the widths of the excited states are like the widths of the 
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excited baryon states, of the order of 100 MeV. In the lowest multiplet (the pions), 
the quark-antiquark pair is in an L = 0 state with spins coupled to zero. Hence 
/ p = 0“, since a fermion and antifermion have opposite relative parity (Section 
6.4). In the first excited state the spins are coupled to 1 and J F = 1“. These are 
the p mesons. With L = 1 and spins coupled to S = I one can construct states 
2 + , 1 + , 0 + , and with L = 1 and spins coupled to S = 0 a state 1 + . All these states 
can be identified in Fig. 1 .4(b). 


1.6 More quarks 

‘Strange’ mesons and baryons were discovered in the late 1940s, soon after the 
discovery of the pions. It is apparent that as well as the u and d quarks there exists 
a so-called strange quark s, and strange particles contain one or more s quarks. An 
s quark can replace a u or d quark in any baryon or meson to make the strange 
baryons and strange mesons. The electric charges show that the s quark, like the 
d, has charge -e/3, and the spectra can be understood if the s is assigned isospin 
7=0. 

The lowest mass strange mesons are the I = 1/2 doublet, K“(su, mass 494 MeV) 
and K°(sd, mass 498 MeV). Their antiparticles make up another doublet, the K + (us) 
and K°(ds). 

The effect of quark replacement on the meson spectrum is illustrated in 
Fig. 1.4. Each level in the spectrum of Fig. 1.4(b) has a member (du) with charge —e. 
Figure 1 .4(c) shows the spectrum of strange (su) mesons. There is a correspondence 
in angular momentum and parity between states in the two spectra. The energy dif¬ 
ferences are a consequence of the s quark having a much larger mass, of the order 
of 200 MeV. 

The excess of mass of the s quark over the u and d quarks makes the s quark in 
any strange particle unstable to decay by the weak interaction. 

Besides the u, d and s quarks there are considerably heavier quarks: the 
charmed quark c (mass ~ 1.3 GeV/c 2 , charge 2e/3), the bottom quark b (mass ~ 
4.3 GeV/c 2 , charge —e/3), and the top quark t (mass ~ 180 GeV/c 2 , charge 2e/3). 
The quark masses are most remarkable, being even more disparate than the lepton 
masses. The experimental investigation of the elusive top quark is still in its infancy, 
but it seems that three quarks of any of the six known flavours can be bound to form 
a system of states of a baryon (or three antiquarks to form antibaryon states), and 
any quark-antiquark pair can bind into mesonic states. 

The c and b quarks were discovered in e + e“ colliding beam machines. Very 
prominent narrow resonances were observed in the e + e“ annihilation cross- 
sections. Their widths, of less than 15 MeV, distinguished the meson states respon¬ 
sible from those made up of u, d or s quarks. There are two groups of resonant states. 
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The group at around 3 GeV centre of mass energy are known as J/\[r resonances, 
and are interpreted as charmonium cc states. Another group, around 10 GeV, the T 
(upsilon) resonances, are interpreted as bottomonium bb states. The current state of 
knowledge of the cc and bb energy levels is displayed in Fig. 1.5. We shall discuss 
these systems in Chapter 17. 

The existence of the top quark was established in 1995 at Fermilab, in pp colli¬ 
sions. 


1.7 Quark colour 

Much informative quark physics has been revealed in experiments with e + e“ col¬ 
liding beams. We mention here experiments in the range between centre of mass 
energies 10 GeV and the threshold energy, around 90 GeV, at which the Z boson 
can be produced. 

The e + e“ annihilation cross-section a(e + e“ —> p+ pr) is comparatively easy 
to measure, and is easy to calculate in the Weinberg-Salam electroweak theory, 
which we shall introduce in Chapter 12. At centre of mass energies much below 90 
GeV the cross-section is dominated by the electromagnetic process represented by 
the Feynman diagram of Fig. 1.6. The muon pair are produced ‘back-to-back’ in the 
centre of mass system, which for most e + e“ colliders is the laboratory system. To 
leading order in the fine-structure constant a = e 2 /(47r SqHc), the differential cross- 
section for producing muons moving at an angle 6 with respect to unpolarised 
incident beams is 

dcr ttot 9 

— = —— (i+cos-6»)sm 6» (1.1) 

d 6 2s 


where 5 is the square of the centre of mass energy (see Okun, 1982, p. 205). In the 
derivation of (1.1) the lepton masses are neglected. Integrating with respect to 6, 
the total cross-section is 


a = 


Ana 2 

2s 


( 1 . 2 ) 


The quantity R(E ) shown in Fig. 1.7 is the ratio 

cr(e + e“ —> strongly interacting particles) 
er(e+ e _ —> p+ p - ) 


(1-3) 


At the lower energies many hadronic states are revealed as resonances, but R seems 
to become approximately constant, R ~ 4, at energies above 10 GeV up to about 
40 GeV. 
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3.5 — 


2S 
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IS 
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10.0 — 
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IP 
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Figure 1.5 Energy-level diagrams for charmonium cc and bottomonium bb states, 
below the threshold at which they can decay through the strong interaction to 
meson pairs (for example cc -> cu + uc). States labelled IS, 2S, 3S have orbital 
angular momentum L = 0 and the IP, 2P states have L = 1. The intrinsic quark 
spins can couple to 5 — 0 to give states with total angular momentum J = L. 

These states are denoted by-; experimentally they are difficult to detect. The 

intrinsic quark spins can also couple to give 5=1. States with 5=1 are denoted 
by —. Spin-orbit coupling splits the P states with 5 = 1 to give rise to states with 
J p = 0 + , 1 + , 2 + . This spin-orbit splitting is apparent in the figure. All the 5=1 

states shown have been measured. —» ,, ,. , —. 
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Figure 1.6 The lowest order Feynman diagram (Chapter 8) for electromagnetic 
p 1 p pair production in e + e” collisions. 


As fundamental particles, quarks have the same electrodynamics as muons, apart 
from the magnitude of their electric charge. The Feynman diagrams that dominate 
the numerator of R in this range 10 GeV to 40 GeV are shown in Fig. 1.8. (The top 
quark has a mass ~ 174GeV/c 2 and will not contribute.) For each quark process 
the formula (1.2) holds, except that e is replaced by the quark’s electric charge at 
the quark vertex, which suggests 



This value is too low, by a factor of about 3. 

In the Standard Model, the discrepancy is resolved by introducing the idea of 
quark colour. A quark not only has a flavour index, u, d, s, c, b, t, but also, for each 
flavour, a colour index. There are postulated to be three basic states of colour, say 
red, green and blue (r, g, b). With three quark colour states to each flavour, we have 
to multiply the R of (1.4) by 3, to obtain 

11 

* = y (L5) 

which is in excellent agreement with the data of Fig. 1.7. 

This invention of colour not only solves the problem of R but, most significantly, 
solves the problem of the symmetry of the baryon states. We have seen (Section 
1.5) that in the absence of any new quantum number baryon states are completely 
symmetric in the interchange of two quarks. Flowever, if these state functions are 
multiplied by an antisymmetric colour state function, the overall state becomes 
antisymmetric, and the Pauli principle is preserved. 

Strong support for the mechanism of quark production represented by the 
Feynman diagrams of Fig. 1.8 is given by other features in the data from 
e + e“ colliders. An e + e“ annihilation at high energies produces many hadrons. 


7fea£A.&wta£teaf P/t^J ic.4 


14 


The particle physicist’s view of Nature 




Figure 1.7 Measurements of R{E) from the resonance region 1 GeV <£<11 GeV 
into the region 11 GeV < E < 60 GeV, which contains no prominent resonances 
and no quark-antiquark production threshold. For £>11 GeV two curves are 
shown of calculations that take account of quark colour and include electroweak 
corrections and strong interaction (QCD) effects. (Adapted from Particle Data 
Group (1996).) 



Figure 1.8 The lowest order Feynman diagrams for quark-antiquark pair produc¬ 
tion in e + e~ collisions at energies below the Z threshold. 
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Figure 1.9 An example of an e + e~ annihilation event that results in two jets of 
hadrons. The figure shows the projection of the charged particle tracks onto a plane 
perpendicular to the axis of the e + e~ beams. This figure was taken from an event 
in the TASSO detector at PETRA DESY. 


These are mostly correlated into two back-to-back jets. An example is shown in 
Fig. 1.9. (The charged particle tracks are curved because of the presence of an 
external magnetic field: the curvature is related to the particle’s momentum.) The 
direction of a jet may be defined as the direction at the point of production of the 
total momentum of all the hadrons associated with it. The momenta of two back- 
to-back jets are equal and opposite. The jet directions may be presumed to be the 
directions of the initial quark-antiquark pair. This interpretation is corroborated by 
an examination of the angular distribution of the jet directions of two-jet events 
from many annihilations, with respect to the e + e“ beams. The angular distribution 
is the same as that for muons (equation (1.1)) after allowance has been made for 
the Z contribution, which becomes significant as the energy for Z production is 
approached. 
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The hadron jets result from the original quark and antiquark combining with 
quark-antiquark pairs generated from the vacuum. The precise details of the pro¬ 
cesses involved are not yet fully understood. 


1.8 Electron scattering from nucleons 

There is a clear advantage in using electrons to probe the proton and neutron, since 
electrons interact with quarks primarily through electromagnetic forces that are 
well understood: the weak interaction is negligible in the scattering process, except 
at very high energy and large scattering angle, and the strong interaction is not 
directly involved. 

In the 1950s, experiments at Stanford on nucleon targets at rest in the laboratory 
revealed the electric charge distribution in the proton and (using scattering data 
from deuterium targets) the neutron. These early experiments were performed at 
electron energies < 500 MeV (Hofstadter et al., 1958). Scattering at higher ener¬ 
gies has thrown more light on the behaviour of quarks in the proton. At these 
energies inelastic electron scattering, which involves meson production, becomes 
the dominant mode. 

At the electron-proton collider HERA at Hamburg, a beam of 30 GeV electrons 
met a beam of 820 GeV protons head on. Many features of the ensuing electron- 
proton collisions are well described by the parton model, which was introduced 
by Feynman in 1969. In the parton model each proton in the beam is regarded as 
a system of sub-particles called partons. These are quarks, antiquarks and gluons. 
Quarks and antiquarks are the particles that carry electric charge. The basic idea of 
the parton model is that at high energy-momentum transfer Q 2 , an electron scatters 
from an effectively free quark or antiquark and the scattering process is completed 
before the recoiling quark or antiquark has time to interact with its environment of 
quarks, antiquarks and gluons. Thus in the calculation of the inclusive cross-section 
the final hadronic states do not appear. 

In the model, at large Q 2 both the electron and the struck quark are deflected 
through large angles. Figure 1.10 shows an example of an event from the ZEUS 
detector at HERA. The transverse momentum of the scattered electron is balanced 
by a jet of hadrons, which can be associated with the recoiling quark. Another jet, 
the ‘proton remnant' jet is confined to small angles with respect to the proton beam. 
Events like these give further strong support to the parton model. 

The success of the parton model in interpreting the data gives added support to 
the concept of quarks. The parton model is not strictly part of our main theme but, 
in view of its interest and importance in particle physics, a simple account of the 
model and its relation to experiment is given in Appendix D. 
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Figure 1.10 This figure illustrating particle tracks is taken from an event in the 
ZEUS detector at HERA, DESY. Figure 1.10(a) is the event projected onto a plane 
perpendicular to the axis of the beams. Figure 1.10(b) is the event projected onto 
a plane passing through the axis of the beams. 

A hadron jet has been ejected from the proton by an electron. The track of the 
recoiling electron is marked e. The initiating beams and the proton remnant jet are 
confined to the beam pipes and are not detected. 


1.9 Particle accelerators 

Progress in our understanding of Nature has come through the inteiplay between 
theory and experiment. In particle physics, experiment now depends primarily on 
the great particle accelerators and ingeneous and complex particle detectors, which 
have been built, beginning in the early 1930s with the Cockroft-Walton linear 
accelerator at Cambridge, UK, and Lawrence’s cyclotron at Berkeley, USA. The 
Cambridge machine accelerated protons to 0.7 MeV; the first Berkeley cyclotron 
accelerated protons to 1.2 MeV. For a time after 1945 important results were 
obtained using cosmic radiation as a source of high energy particles, events 
being detected in photographic emulsion, but in the 1950s new accelerators 
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Table 1.5. Some particle accelerators 


Machine 

Particles collided 

Start date-end date 

TEVATRON 

p: 900 GeV 

1987 

(Fermilab, Batavia, 11) 

p : 900 GeV 


SLC 

e+ : 50 GeV 

1989-1998 

(SLAC, Stanford) 

e~ : 50 GeV 


HERA 

e: 30 GeV 

1992 

(DESY, Hamburg) 

p: 820 GeV 


LEP2 

e+ : 81 GeV 

1996-2000 

(CERN, Geneva) 

e“ : 81 GeV 


PEP-II 

e- : 9 GeV 

1999-2008 

(SLAC, Stanford) 

e+ : 3.1 GeV 


LHC 

p: 7 TeV 

2008 

(CERN, Geneva) 

p: 7 TeV 



provided beams of particles of increasingly high energies. Some of the machines, 
past, present and future, are listed in Table 1.5.. Detailed parameters of these 
machines, and of others, may be found in Particle Data Group (2005). 

The TEVATRON at Fermilab is where the top quark was discovered. The physics 
of the top quark is as yet little explored. It makes only a brief appearance in our text, 
though it is an essential part of the pattern of the Standard Model. The upgraded 
LEP2 at CERN is able to create W + W“ pairs, and will allow detailed studies of 
the weak interaction. At Stanford, PEP-II and the associated ‘BaBar’ (BB) detector 
is designed to study charge conjugation, parity (CP) violation. The way in which 
CP violation appears in the Standard Model is discussed in Chapter 18. 

The most ambitious machine likely to be built in the immediate future is the 
Large Hadron Collider (LHC) at CERN. It is expected that with this machine it will 
be possible to observe the Higgs boson, if such a particle exists. The Higgs boson is 
an essential component of the Standard Model; we introduce it in Chapter 10. It is 
also widely believed that the physics of Supersymmetry, which perhaps underlies 
the Standard Model, will become apparent at the energies, up to 14 TeV, which will 
be available at the LHC. 


1.10 Units 

In particle physics it is usual to simplify the appearance of equations by using units 
in which li = 1 and c = 1. In electromagnetism we set £o = 1 (so that the force 
between charges q i andgi is q^q^f^itr 1 ), and /xq = 1, to give c 2 = (/zo£o) _1 = 1- 
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We shall occasionally reinsert factors of h and c where it may be reassuring 
or illuminating, or for the puiposes of calculation. It is useful to remember 
that 

he ~ 197 MeV fm, e 2 4n ~ 1.44 MeV fm, 
a = e 2 /4nhc ~ (1/137), c ~ 3 x 10 23 fms -1 . 

Energies, masses and momenta are usually quoted in MeV or GeV, and we shall 
follow this convention. 
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Lorentz transformations 


The equations of the Standard Model must be consistent with Einstein’s principle 
of relativity, which states that the laws of Nature take the same form in every 
inertial frame of reference. An inertial frame is one in which a free body moves 
without acceleration. An earth-bound frame approximates to an inertial frame if the 
gravitational field of the earth is introduced as an external field. We shall assume 
that the reader is familiar with rotations, and with proper Lorentz transformations 
and the relativistic mechanics of particle collisions. This chapter is very largely 
about notation, which may make for dry reading; however an appropriate notation 
is crucial to the exposition of any theory, and particularly so to a relativistic theory, 
such as the Standard Model. 


2.1 Rotations, boosts and proper Lorentz transformations 

The time and space coordinates of an event measured in different inertial frames 
of reference are related by a Lorentz transformation. A rotation is a special case of 
a Lorentz transformation. Consider, for example, a frame K' that is rotated about 
the z-axis with respect to a frame K, by an angle 6. If (t, r) are the time and space 
coordinates of an event observed in K, then in K' the event is observed at (f, r') 
and 


t' = t 

x' = x cos 9 + y sin# 
y' = — x sin 9 + y cos 9 

z! = z. 

Lorentz transformations also relate events observed in frames of reference that 
are moving with constant velocity, one with respect to the other. Consider, for 
example, an inertial frame K' moving in the z-direction in a frame K with velocity 
v, the spatial axes of K and K' being coincident at t = 0. If (t, r) are the time and 
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space coordinates of an event observed in K, and (Y, r') are the coordinates of the 
same event observed in K', the transformation takes the form 


ct' = y(ct — fz) 
x' = x 

y' = y 

z! = y(z - fict). 


( 2 . 2 ) 


where c is the velocity of light, f = v/c, y = (1 — /l 2 ) -1 / 2 . 

Putting x° = ct, x 1 — x, x 2 — y, x 3 — z, the x M are dimensionally homoge¬ 
neous, and an event in K is specified by the set x 11 , where p = 0, 1, 2, 3. Greek 
indices in the text will in general take these values. With this more convenient 
notation, we may write the Lorentz transformation (2.2) as 


x'° = x° cosh 9 — x 3 sinh 6 



x' 3 — — x°sinh 6 + x 3 cosh$, 


(2.3) 


where we have put = v/c — tanh0; then y — cosh#. 

Transformations to a frame with parallel axes but moving in an arbitrary direc¬ 
tion are called boosts. A general Lorentz transformation between inertial frames K 
and K' whose origins coincide at x° = x’° = 0 is a combination of a rotation and 
a boost. It is specified by six parameters: three parameters to give the orientation 
of the K’ axes relative to the K axes, and three parameters to give the compo¬ 
nents of the velocity of K' relative to K. Such a general transformation is of the 
form 


x’ 11 = L ll v x v , (2.4) 

where the elements V L v of the transformation matrix are real and dimensionless. 
We use here, and subsequently, the Einstein summation convention-, a repeated 
‘dummy’ index is understood to be summed over, so that in (2.4) the notation 
^ as been omitted on the right-hand side. The matrices Uf , form a group, 
called the proper Lorentz group (Problem 2.6 and Appendix B). The significance 
of the placing of the superscript and the subscript will become evident shortly. 

The interval (A s) 2 between events x IJ and x 1 ' + A.r 11 is defined to be 

(As) 2 = (Ax 0 ) 2 - (Ax 1 ) 2 - (Ax 2 ) 2 - (Ax 3 ) 2 . (2.5) 

It is a fundamental property of a Lorentz transformation that it leaves the interval 
between two events invariant: 

(As') 2 = (As) 2 . (2.6) 
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Lorentz transformations 


We can express (As) 2 more compactly by introducing the metric tensor (g in ;Y 


/I 

0 

0 

°\ 

0 

-1 

0 

0 

0 

0 

-1 

0 

\0 

0 

0 

-l) 


(2.7) 


Then 


(As) 2 = g^Ax^Ax v , (2.8) 

where the repeated upper and lower indices are summed over. Note that g /lv = g Vfl \ 
it is a symmetric tensor. It has the same elements in every frame of reference. 


2.2 Scalars, contravariant and covariant four-vectors 

Quantities, such as (As) 2 , which are invariant under Lorentz transformations are 
called scalars. We define a contravariantfour-vector to be a set a 11 which transforms 
like the set x 11 under a proper Lorentz transformation: 

a' 11 = L 11 v a v . (2.9) 

A familiar example of a contravariant four-vector is the energy-momentum vector 
of a particle (E/c, p). 

We define the corresponding covariant four-vector a fl , carrying a subscript, 
rather than a superscript, by 

<V = g /i v« v - (2.10) 

Hence if a^ = (a 0 , a), then = ( a °, —a). 

We can write the invariant As 2 as 

As 2 — g^ v Ax^Ax v = Ax v Ax v . 

More generally, if a M , b 1 ' are contravariant four-vectors, the scalar product 

g liV a >l b v = O/E' 1 = a^b^ = a°b° — a-b (2.11) 

is invariant under a Lorentz transformation. 

We can define the contravariant metric tensor g llv so that 

a^ = g^a v . ( 2 . 12 ) 

The elements of g^ are evidently identical to those of g llv . 

The transformation law for covariant vectors, which we write 

< = L/a v , (2.13) 
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follows from that for contravariant vectors (Problem 2.1). Note that, in general, 
Lj, ' 1 is not equal to L y n (Problem 2.1). Using the invariance of the scalar product 
(2.11), we have 

d tl b' 11 = L fl v L fJ ‘ p a v b p = a v b v 

and 


a' p b' p = L p v L ll p a l ’bp = a v b< 

Since the a jL and b !L are arbitrary, it follows that 

/Mr p — r v jp — xp 

V 1 -'/! - *-*11 ^ p - U y 


where 




1, p = V 

0, p^v. 


(2.14) 


2.3 Fields 

The Standard Model is a theory of fields. We shall be concerned with fields that at 
each point x of space and time transform as scalars, or vectors, or tensors (defined 
later in this section). We use x to stand for the set (jc°, x 1 , x 2 , x 2 ). For example, 
we shall see that the electromagnetic potentials form a four-vector field, and the 
electromagnetic field is a tensor field. We shall also be concerned with scalar fields 
<p(x), which by definition transform simply as 

0V) = <«*), (2.15) 


where x > and x refer to the same point in space-time. 

We can construct a vector field from a scalar field. Consider the change of field 
d (f> in moving from x to a neighbouring point x + dx, with dr infinitesimal. Then 


d0 - — dx p 
dx^ 


is invariant under a Lorentz transformation. Since the set dx 11 make up an arbitrary 
contravariant infinitesimal vector, the set dc/)/dx p must make up a covariant vector 
(Problem 2.3). Following the subscript convention we write 


3 </> 
dx^ 


1 3 cj) 

- —, V0 = 3av 
c dt 1 " 


(2.16) 


We can then also define the contravariant vector 


3^ = /T3 y 0 = 


d(j) 

dxn. 


/I 30 
\c 3 1 ’ 



(2.17) 
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Lorentz transformations 


It follows that 

a- (V</>) 2 (2.18) 

and 

„ 1 3 2 ch ~ 

( 2 - 19 ) 

are invariant under Lorentz transformations. 

We can define, and we shall need, tensor quantities. Tensors T^ v , T llv , T ,L V , T ,,v y, 
etc., are defined as quantities which transform under a Lorentz transformation in 
the same way as a^a v , a p a v , a^ciy, a^a v ax, etc. For example, 

T'HV _ L H pL V xT P^' 

The ‘contraction’ by summation of a repeated upper and lower index leaves 
the transformation properties determined by what remains. For example, 7’ , ' // is a 
scalar, T ,IV fl is a contravariant four-vector. The metric tensors g pv , g llv conform 
with the definition, and this leads to the conditions on the matrix elements L ll v : 

g P v = g P \L p ,j.L k v . (2.20) 

The conditions (2.20) and (2.14) are equivalent. 

As well as scalars, vectors and tensors there are also very important objects 
called spinors, and spinors fields, which have well-defined rules of transformation 
under a Lorentz transformation of the coordinates. Their properties are discussed 
in Appendix B and Chapter 5. 


2.4 The Levi-Civita tensor 


The Levi-Civita tensor e^ v x p is defined by 


SfivXp — 


+ 1 if p, v, X, p is an even permutation of 0, 1, 2, 3; 
— 1 if /x, v, X, p is an odd permutation of 0, 1, 2, 3; 
0 otherwise. 


For example, £ 102 3 = -L £1203 = +1, £0023 = 0. 

It is straightforward to verify that e llv y_ p satisfies 


" /jLVXp 


_ T OL J fir y J 8 

— ^fi ^v ^ p c-afiyS 

= S/ivXfi det(L) — ^fivkpii 


( 2 . 21 ) 


using the definition of a determinant (Appendix A), and the result that the determi¬ 
nant of the transformation matrix is 1 (Problems 2.4 and 2.5). 
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The corresponding Levi-Civita symbol in three dimensions, e,-^, is defined sim¬ 
ilarly. It is useful in the construction of volumes, since 

ejj k A' B 'C k = A • (B x C) 

is the volume of the parallelepiped defined by the vectors A, B, C. The four¬ 
dimensional Levi-Civita tensor enables one to construct four-dimensional volumes 
Ziivkp^'b' c' cP’. The contraction of indices leaves this a Lorentz scalar. In partic¬ 
ular, taking a,b,c,d to be infinitesimal elements parallel to the axes 0x p so that 
a — (djc°, 0, 0, 0), b — (0, dx l , 0, 0), c = (0, 0, dx 2 , 0), d = (0, 0, 0, dx 3 ), it fol¬ 
lows that the ‘volume’ element of space-time 

d 4 x = djc°djc 1 dx 2 dx 3 = cd 3 x d t 
is a Lorentz invariant scalar (see also Problem 2.9). 


2.5 Time reversal and space inversion 

The operations of time reversal: 



and space inversion: 

x'° = x° 

x " = —x 1 , i = 1,2,3, 

also leave (A.v) 2 invariant, but these transformations are excluded from the proper 
Lorentz group. They are however of interest, and will arise in later chapters. 

Problems 

2.1 Show that L/ = g P pL p k g Xv . Verify Lo 1 = -L l 0 . 

2.2 Using (2.14), show that the inverse transformations to (2.9) and (2.13) are 

a p — a' v L v p , a p = a' V L V M . 

Hence show 

2.3 Prove that if <p(x) is a scalar field, the set (tkp/dx 1 ' ) makes up a covariant vector 
field. 


Pirtic. 7feo£A.&wta£teaf P/t^J ic.1 


Lorentz transformations 


26 

2.4 Using Problem 2.1, show that det( L ll v ) = det( L l f) and hence show, using equation 
(2.14), that 


det(ZA,) = ±1. 


2.5 Show that det( U l v ) for both the rotation (2.1) and the boost (2.3) is equal to +1. 
This is a general property of proper Lorentz transformations that distinguishes them 
from space reflections and time reversal (Section 2.5), for which the determinant of 
the transformation equals — 1. 

2.6 Show that the matrices Lf corresponding to proper Lorentz transformations form 
a group. 

2.7 Show that is a tensor. 

2.8 The frequency co and wave vector k of an electromagnetic wave in free space make 
up a contravariant four-vector 


k — (w/c, k). 


The invariant k lt k 1 ' = 0; this corresponds to the dispersion relation of = c 2 k 2 . Show 
that a wave propagating with frequency co in the ^-direction, if viewed from a frame 
moving along the z-axis with velocity v, is seen to be Doppler shifted in frequency, 
with 


of — e 6 co = 


1 — v/c 

- co 

1 + v/c 


2.9 By considering the Jacobian of the Lorentz transformation, show that the four¬ 
dimensional volume element d 4 x = djc°dx 1 dx 2 dx 3 is a Lorentz invariant. 


2.10 Show that s ln ,- / f) is a pseudo-tensor, i.e. it changes sign under the operation of space 
inversion. 
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The Lagrangian formulation of mechanics 


In most introductory texts on quantum mechanics you will find ‘Hamiltonian’ in the 
index (see our equation (3.8)) but you are less likely to find ‘Lagrangian’. However, 
quantum field theories are most conveniently described in a Lagrangian formalism, 
to which this chapter is an introduction. 


3.1 Hamilton’s principle 


The classical dynamics of a mechanical (non-dissipative) system is most elegantly 
derived from Hamilton’s principle. A closed mechanical system is completely char¬ 
acterised by its Lagrangian L(q, q): the variables q(t), which are functions of time, 
are a set of coordinates qi(t),q 2 (t), ..., q s {t) which determine the configuration of 
the system at time t. In particular, the q , might be the Cartesian coordinates of a set 
of interacting particles. We restrict our discussion to the case where all the q t {t) are 
independent. In non-relativistic mechanics we take L = T — V , where T(q, q) is 
the kinetic energy of the system and V(q) is its potential energy. 

Given L, the action S is defined by 


ft 2 

S = / L(q, q) dr. 


(3.1) 


The value of S depends on the path of integration in q-spacc. The end-points of 
the path are fixed at times t\ and to, but the path is otherwise unrestricted. S is 
said to be a functional of q(t). Hamilton’s principle states that S is stationary for 
that particular path in g-space determined by the equations of motion, so that if we 
consider a variation to an arbitrary neighbouring path (Fig. 3.1), 8S = 0, where 


8S = 8 



'h 
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The Lagrangian formulation of mechanics 



Figure 3.1 A schematic representation of the path in ir/-space determined by the 
equations of motion (full line) and a neighbouring path (dashed line). 


Since Sq — d{8q)/dt, we can integrate the second term in this integral by parts, to 
give 


8S = 



" 3L 
_dqi 


d /9 L 
dt \ dqi) _ 


8qi dt. 


(3.2) 


The ‘end-point’ contributions from the integration by parts are zero, since 8q(t\ ) = 
8q(t 2 ) = 0. 

The variations 8qt (f) are arbitrary. It follows from (3.2) that the condition 8S = 0 
requires 


d 

dt 



3 L 



i = 1, ..., s. 


(3.3) 


These are the Euler-Lagrange equations of motion. In classical non-relativistic 
mechanics they are equivalent to Newton’s equations of motion. As a simple exam¬ 
ple, consider a particle of mass m moving in one dimension in a potential V(x). Then 
L = T — V = ( mx 2 /2 ) — V(x). From (3.3) we have immediately mx = —dV/dx, 
which is Newton’s equation of motion for the particle. 

An external, and possibly time-dependent, field can be included in the Lagrangian 
formalism through a time-dependent potential. In our one-dimensional example 
above, V(x) may be replaced by V(x, t). Making the Lagrangian L depend explicitly 
on t does not affect the derivation of the field equations. 
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It is important to note that the Lagrangian of a given system is not unique: we 
can add to L any function of the form df(q,t)/dt where f(q,t) is an arbitrary function 
of q and t. Such a term gives a contribution [ /'(c/ 2 , tf) — f(q 1 , ?i)] to S, independent 
of the path, and hence leaves the equations of motion unchanged. 


3.2 Conservation of energy 


In the case of a closed system of particles, interacting only among themselves, the 
equations of motion of the system do not depend explicitly on the time t, since the 
physics of a closed system does not depend on our choice of the origin of time. 
There is no reason to doubt that the laws of physics at the time of Archimedes, or 
the time of Newton, were the same as they are for us. Hence for a closed system 
we must be able to construct a Lagrangian L(q, q) that does not depend explicitly 
on t. For such a Lagrangian, 


dL 

d T 


E 


3L dL 

L oqi dqi 


Taking the qft) to obey the equations of motion and substituting for dL/dqj from 
(3.3) we obtain 



or 


Thus 


d 

dt 


v ^ dL 

> . 9/ 

V 3qi 


= 0 . 


(3.4) 


E = 


dL 

y, t —<?/ - l 

i 9 <?/ 


(3.5) 


remains constant during the motion, and is called the energy of the system. This 
result exemplifies Noether’s theorem (Section 1.2): we have here a conservation 
law stemming from the symmetry of the Lagrangian under a translation in time. 

For a closed system of non-relativistic particles, with a potential function 
V(qi), dL/dqi = dT/dqi. Since the kinetic energy T is a quadratic function of 
the (Problem 3.1), (dT/dqj)qi = IT. Hence 


E = 2T — (T — V) = T + V. 


We recover the result of elementary mechanics. 
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The Lagrangian formulation of mechanics 


The generalised momenta, p,, are defined by 


dL 



(3.6) 


The Hamiltonian of a system is defined by 

H{p,q) = J2pifi ~ L. (3.7) 

i 

In terms of p and q, the energy equation (3.5) for a closed system becomes 

H(p, q) = E. (3.8) 

This equation, which is a consequence of the homogeneity of time, is a foundation 
stone for making the transition from classical to quantum mechanics. 


3.3 Continuous systems 


To see how Hamilton’s principle may be extended to continuous systems, we con¬ 
sider a flexible string, of mass p per unit length, stretched under tension F between 
two fixed points at x = 0 and x = l, say, but subject to small transverse displace¬ 
ments in a plane. Gravity is neglected. If cp(x , t) is the transverse displacement from 
equilibrium of an element dr of the string at x, at time t, then the length of the string 
is 


f (dr 2 + d</> 2 ) 1/2 = f [1 +(9(/>/ar) 2 ] 1/2 dr. 

Jo Jo 

To leading order in dcp/dx, which we take to be small for small displacements, 
the extension of the string is / { , \(<icj)/dx) 2 dx, and the potential energy of stretch¬ 
ing under the tension F is fJ | F (<:)(/)/‘dx) 2 dx. The kinetic energy of the string is 
\p{d(p/dt) 2 dx. Hence 


L = T-V = f 
Jo 


where 


= / £ dx , 


i /a<n 2 i /dr 2 

2 p \dt) 2 F \dx 


(3.9) 


(3.10) 


is called the Lagrangian density. 

The corresponding action is 

1 pt2 

dr / d t£(<p, 4>'), 

Jh 



writing d(f>/dt = <p and dtp/dx = (p'. 
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Figure 3.2 The actual motion of the string between an initial displacement 0(.r, t \) 
and a final displacement <p(x , U) generates a surface in space-time. 


Hamilton’s principle states that the action is stationary for that surface that 
describes the actual motion of the string between its initial displacement 0(jc, t\ ) 
and its final displacement 0(jt, O (Fig. 3.2). We have 


SS 


Jo Jt , 


d t 


d£ . d£ 


Using 5(0) = 3(50)/3t and 5(0') = 3(50)/d;r we integrate each term by parts. 
Again, the boundary contributions are zero since 


S0(jt, t\) = 50(jc, ti) = 0 for all*, 
50(0, t) = 50(/, t) = 0 for all t. 


We are left with 


SS = 



50. 


Since 50(*, t) is arbitrary, the condition 55 = 0 gives 

dt \d(f)J dx \30V 


(3.11) 


(3.12) 
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The Lagrangian formulation of mechanics 


Inserting the Lagrangian density (3.10), we obtain the familiar wave equation 
for small amplitude waves on a string: 


3 2 cj) d 2 (j) 

P ~di 2 ~ F Jx 2 


= 0 . 


Thus continuous systems can be described in a Lagrangian formalism by a suitable 
choice of Lagrangian density, and clearly the method can be extended to waves 
in any number of dimensions. By analogy with (3.6) and (3.7), we can define the 
momentum density 


d£ 

n( « = i? 


and the Hamiltonian density 


U=U4>-£. 


(3.13) 


Since the Lagrangian density (3.10) does not depend explicitly on t, it follows that 

E = j ftdx = J (J^r(j)-£jdx (3.14) 

remains constant during the motion (Problem 3.2). This result is the analogue of 
(3.5). 


3.4 A Lorentz covariant field theory 

In three spatial dimensions, the action is of the form 

S = y £dxdydzdt = J £ dx 0 dx*dx 2 dx 3 . (3.15) 

The ‘volume element’ dx°dx 1 dx 2 dx 3 = d 4 x is a Lorentz invariant (Section 2.4). 
Hence S' is a Lorentz invariant if the Lagrangian density £ transforms like a scalar 
field. The covariance of the field equations is then assured. Other symmetries 
required of a theory may be built into £. 

Consider a Lorentz invariant Lagrangian density of the form 

£ = £{</>, 8^), (3.16) 

where 4>(x) = 4>(x°, x ) is a scalar field. At any point x in space-time, such a 
Lagrangian density depends only on the field and its first derivatives at that point. 
The field theory is said to be local: there is no ‘action at a distance’. This will be an 
important feature of the Standard Model. The field equation is easily derived from 
the condition 8S = 0, together with the condition that the field vanishes at large 
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distances, and we find 


3</> * V3(3 *$)) 


= 0 . 


(3.17) 


3.5 The Klein-Gordon equation 

The Lorentz invariant Lagrangian density 

£- = -m 2 (p 2 ] = i[9^09^0 -m 2 cj) 2 ], (3.18) 

where </>(*) is a real scalar field, is a particular case of (3.16). The field equation 
(3.17) becomes 

— 3 ^(j) — m 2 (j) = 0, 


or 

(_|. + VW) ■*, = «. 

This equation is known as the Klein-Gordon equation. 
The equation has wave-l ik e solutions 

4>( r, t) = a cos(k • r — ooG + 0k) 


(3.19) 


where the frequency co^ is related to the wave vector k by the dispersion relation 


a>l = k 2 + m 2 , 


(3.20) 


and 0k is an arbitrary phase angle. 

For mathematical simplicity we shall take the solutions </>(r, t) to lie in a large 
cube of side /, volume V = / 3 , and apply periodic boundary conditions, so that 
k - (2jtn 1 / /, Inn'll l, 2nn^/l) where n\, ni, n 3 are any integers 0, ±1, ±2, .. . 
The general solution of (3.19) is a superposition of such plane waves: 

</>(r, t) = V ( —e i(k ' r_u;t) + _^L e - l(k r - wt A . (3.21) 

yfV ^ \sf2Mk ypMk ) 

The factors y/2col are introduced for later convenience, and the phase factors have 
been absorbed into the complex wave amplitudes Ok- The sum is over all allowed 
values of k. 

With the de Broglie identifications of E = a>k,p = k (recall it = 1, c = 1) the 
dispersion relation for cok is equivalent to the Einstein equation for a free particle, 

E 2 = p 2 + m 2 . 
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The Lagrangian formulation of mechanics 


We may conjecture that the Klein-Gordon equation for 0 describes a scalar 
particle of mass m. There is no vector associated with a one-component scalar field, 
and the intrinsic angular momentum associated with such a particle is zero. 

We shall see a Lagrangian density of the form (3.18) arising in the Standard Model 
to describe the Higgs particle. At a less fundamental level, the overall motion of 
the 7T° meson, which is an uncharged composite particle, is described by a similar 
Lagrangian density. 


3.6 The energy-momentum tensor 

The equations expressing both conservation of energy and conservation of linear 
momentum are obtained by considering the change in £ corresponding to a uniform 
infinitesimal space-time displacement 

x 11 -* + 8a^, (3.22) 

where 9a M does not depend on x. The corresponding change in 0 is 

90 = (d v (p)8a v . (3.23) 


Since £ does not depend explicitly on the x 11 , 

d£ d£ 

8£ = —8(f) +-9(9„0). 

90 v 9(9^0) 

Using the field equation (3.17) for 9£/90, and the fact that 9(9^0) = 9^(90), we 
can rewrite this as 


8£ = 9,, 


d£ 


3O/X0) 


90 


and then, from (3.23), 


8£=d u 


3£ 


L 9(9^0) 


9y0 


8a v . 


We have also 


where, as in (2.14), 


8£ = —9a M = 8^—8a v 
dxv dx^ 


8f = 


1, ix = v 
0, ix / v. 


"PilHxx 7/tatAow.atrgiiZ P/ty.Atc.4 









3.6 The energy-momentum tensor 35 

Since the Sa v are arbitrary, it follows on comparing these expressions for S£ that 

d£ 


id(d^) 


M ~ S»£ 


= 0, 


or 


9 Tf = 0, where T' J ' = 


d£ 


L 3(9^0) 


9v</> - K£ 


(3.24) 


(3.25) 


7^ is the energy-momentum tensor. The component 

n d£ . 

T° = - £ 

0 dcp 

corresponds to the Hamiltonian density defined in equation (3.13), and is inter¬ 
preted as the energy density of the field; in a relativistic theory, the energy density 
transforms like a component of a tensor. The v = 0 component of (3.25) may be 
written 

^(r 0 °) + v • To = o, 


dt 


(3.26) 


and expresses local conservation of energy, with To = ( T {) . T 0 , T {] ) interpreted as 
the energy flux. Integrating (3.26) over all space and using the divergence theorem 
yields 


/ 




(3.27) 


provided the field vanishes at large distances. This equation expresses the overall 
conservation of energy. 

Similarly the v — 1,2,3 components of (3.24) correspond to local conservation 
of momentum, with the overall total momentum of the field given by 


p ‘=J t; 


°d 3 x. 


(3.28) 


As with the energy, the total momentum of the field is conserved if the field vanishes 
at large distances. 

In the case of the Klein-Gordon Lagrangian density (3.19), 

d£ 

—r = <j>, 

dtp 


and the energy density of the field is 

1 . 


T 0 ° =-[cj> 2 + (Vcl)) 2 +m 2 ct) 2 ]. 


(3.29) 
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The Lagrangian formulation of mechanics 


Expressing cp in terms of the field amplitudes c/| ( and a k , and integrating over all 
space, gives the total field energy 

H = I r 0 °d 3 x = «k« k " k - (3-30) 

J k 

In obtaining this expression we have used the orthogonality of the plane waves 

i/e‘*-«' d ’x = J kk .. 

Similarly from (3.28) the total momentum of the field can be shown to be 

P = ^ fl *o k k. (3.31) 

k 


3.7 Complex scalar fields 

It is instructive to consider also complex scalar fields d> = (cj)\ + i</> 2 ) /x/2 satisfying 
the Klein-Gordon equation. We shall see in Section 7.6 that if the field <f> carries 
charge q, then the field 4>* carries charge — q. The Klein-Gordon equation for a 
complex field <f> is obtained from the (real) Lagrangian density 

£= -m 2 r<D. (3.32) 

We introduce here a device that we shall often find useful. Instead of varying the 
real and imaginary parts of <f> to obtain the field equations, we may vary <5 and 
its complex conjugate O* independently. These procedures are equivalent. Varying 
4>* in the action constructed from (3.32) yields, easily, 

- 9^0 - m 2 d> = 0. (3.33) 


(Varying 4> gives the complex conjugate of this equation.) 

Note that the Lagrangian density (3.32) is the sum of contributions from the 
scalar fields f\ and r/q: 


£ = 9 ^ 0 * 9 ^ 


^[d^d^cpi -nrcp]] 


(3.34) 


The general solution of (3.33) is a superposition of plane waves of the form 


-77? 


( (lk e i(k-r-aiQ _|_ 

\V2m k 


^k — i(kr— cot) 


y/ 2&) k 


(3.35) 


where n k and Z? k are now independent complex numbers. The field energy becomes 

H = Y J { a W + KKW (3.36) 

k 
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We shall see that we can inteipret this expression as being made lip of the distinct 
contributions of positively and negatively charged fields. (The jx + and jt~ mesons 
are composite particles whose overall motion is described by complex scalar fields.) 


Problems 


3.1 Show that the kinetic energy of a system of particles, whose positions are determined 
by q(t), is a quadratic function of the q,. 

3.2 Show that dE/dt — 0, where E is given by equation (3.14). 


3.3 For the stretched string of Section 3.3, show that the Hamiltonian density is 


1 


U=-p\ — 
dt 


3 4 > 


1 


- F 3x 


3 </> 


The nth normal mode of oscillation, with wave amplitude A„, is given by 


4> n (x , t) = A„ sin(k„x ) sin (co n t) 

where k n = rax/l, co n = (F / p) x ^k n . Show that the total energy is A„ 2 <u„ 2 p//4 and 
oscillates harmonically between potential energy and kinetic energy. 

3.4 Verify the expressions (3.30) and (3.31) for the energy and momentum of the scalar 
field given by equation (3.21). 

3.5 Show that the Schrodinger equation for the wave function iJ/( r, 1) of a particle of mass 
m moving in a potential V (r) may be obtained from the Lagrangian density 

£ = — (l/2i) ( ip*— - — i/A - (1/2 m)Vx/r* ■ - fVf 

\ 3 t dt ) 

(Note that £ is real, but not Lorentz invariant.) 
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Classical electromagnetism 


Maxwell’s theory of electromagnetism is, along with Einstein’s theory of grav¬ 
itation, one of the most beautiful of classical field theories. In this chapter we 
exhibit the Lorentz covariance of Maxwell’s equations and show how they may be 
obtained from Hamilton’s principle. The important idea of a gauge transformation 
is introduced, and related to the conservation of electric charge. We analyse some 
properties of solutions of the field equations. Finally, we generalise the Lagrangian 
to describe massive vector fields, which will figure in later chapters. 


4.1 Maxwell’s equations 


In common with much of the literature, we shall use units in which the force between 
charges q \ and qi is qiqi/^nr 2 , and the velocity of light c = 1. (Thus in these units 
Ho = 1, e 0 — 1.) Maxwell’s equations then take the form 


V • E = p (a), 

V • B = 0 (c), 


„ 3E 

V xB --=J 

V x E+ — =0 
3 1 


(b), 

(d). 


(4.1) 


E and B are the electric and magnetic fields, p and J are the electric charge and 
current densities. In this chapter we do not consider the dynamics of p and J, but 
take them to be ‘external’ fields that we are free to manipulate. The inhomogeneous 
equations (a) and (b) are consistent with the observed fact of charge conservation, 
which is expressed by the continuity equation: 


dp 

37 


+ V • J — 0. 


This equation takes the Lorentz invariant form 


3 ^ = 0 


(4.2) 
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if we postulate that the charge-current densities 

J ll = (.P, J) (4-3) 

make up a contravariant four-vector field. 

Introducing a scalar potential </> and a vector potential A, the homogeneous 
equations (c) and (d) of the set (4.1) are satisfied identically by 

3A 

B = V x A, E = — V</>-. (4.4) 

dt 

We postulate that the potentials 

= (</., A) (4.5) 


make up a contravariant four-vector field also. 

Maxwell’s equations may be written in terms of the antisymmetric tensor F tlv , 
defined by 


(0 

—E x 

-Ey 

~E Z \ 

E x 

0 

~B Z 

By 

Ey 

B z 

0 

—B x 

\E Z 

-By 

B x 

0 ) 


= _ d v A v = 


It is apparent that the electromagnetic field is a tensor field. For example, 


(4.6) 


F 01 = dA l /dx 0 - 9A°/9Jti = 3 A x /dt + 90/3jc = —E x . 


Thus the components of the electromagnetic field transform under a Lorentz trans¬ 
formation like the elements of a tensor. 

The homogeneous Maxwell equations correspond to the identitities 

3 x F #tw + d v F klM + 3 M F l ’ x = 0, (4.7) 

where X, fi, v are any three of 0, 1, 2, 3, as the reader may easily verify. The 
inhomogeneous equations take the manifestly covariant form 

9 m F^ = J v . (4.8) 

For example, with v — 0, looking at the first column of F llv , and noting 3 ;/ = 
(3/ dt, V), gives 

V • E = p. 


4.2 A Lagrangian density for electromagnetism 

We now seek a Lagrangian density £ that will yield Maxwell’s equations from 
Hamilton’s principle. If £ is Lorentz invariant, the action 

S= I £ d 4 x = j £ dx°dx‘d.r 2 dx 3 (4.9) 
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is also Lorentz invariant, since d 4 x is invariant (Section 2.4 and Section 3.4), and 
the field equations which follow from the condition <55 = 0 will take the same form 
in every inertial frame of reference. 

Although Maxwell’s equations do not refer explicitly to the potentials A 1 ' , to 
derive the equations from Hamilton’s principle requires the potentials to be taken 
as the basic fields which are to be varied. The ‘stretched string’ example of Section 
3.4 suggests that £ should be quadratic in the first derivatives of the field. A suitable 
Lorentz invariant choice is found to be 

£=- l -F pv F pv - J p A p . (4.10) 


Varying the fields A 1 ', while keeping the charge and current densities J p fixed, yields 
Maxwell’s equations, as we shall show in some detail. (Subsequent arguments will 
be more terse!) 

We may write 


5 = 


/ 


1 


:g p \gvp 


F kp F pv - J p A b 


d 4 x. 


(4.11) 


Then 


<55 = 


-g pkgvp F kp 8F pv - J P 8A, d 4 x 


/ 

■/ 

= J [-F xp d x 8A p - J^SA^d 4 x, since F kp = -F pk . 


-~F kp {d x 8A p -d p 8A x )- J P 8A P 


The first term we integrate by parts. The boundary terms vanish for suitable condi¬ 
tions on the fields, so that we are left with 


8S = 



- J P ]8A P d 4 x. 


Setting <55 = 0 for arbitrary 8A P gives the inhomogeneous Maxwell equations (4.8). 
(The homogeneous equations (4.7) are no more than identities.) 


4.3 Gauge transformations 

The four-potential = (</>, A) is not unique: the same electromagnetic field tensor 
F ,,v is obtained from the potential 

A p + d p x =(0 + dx/dt, A- V/), (4.12) 
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where x( x ) is an arbitrary scalar field, since the additional terms which appear in 
F in; are identically zero: 


3^9 y x - d v d^x = 0. 


The transformation A 1 ’ —> A' 11 = A 1 ' + 3 ; 'x is called a gauge transformation. 

Under a gauge transformation, the action (4.11) acquires an additional term A S, 
where 



We have integrated by parts to obtain the second line and again assumed that the 
boundary terms vanish. A S is zero for arbitrary / if, and only if, 


3 % = 3 ^ = 0 , 


which is just equation (4.2). Thus the gauge invariance of the action requires, and 
follows from, the conservation of electric charge. 


4.4 Solutions of Maxwell’s equations 

In terms of the potentials, the field equations (4.8) are 


(3 ^)A V - 3 V (3 M A^) = J v . 


(4.13) 


We stress again that there is much arbitrariness in the solutions to these equations. 
Equivalent solutions differ by gauge transformations. It is usual to impose a gauge¬ 
fixing condition. For example in the ‘radiation gauge’ we set V • A = 0, everywhere 
and at all times (Problem 4.2). This has the disadvantage of not being a Lorentz 
invariant condition - it will not be true in another, moving, frame - but it does 
display important features of the theory. In the radiation gauge the field equation 
for A 0 becomes 


(3/3'') A 0 = -V 2 A° = J° 


(setting v = 0 in (4.13), and noting if A= 3 0 A 0 since in the radiation gauge 
3/ A' =0). This equation has the solution 
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Hence, in the radiation gauge, A 0 is determined entirely by the charge density to 
which it is rigidly attached! There are no wave-like solutions. The vector compo¬ 
nents A' (i — 1, 2, 3) satisfy the inhomogeneous wave equation 

3 2 A ? 3 n 

-^-V 2 A = J-VA°. (4.14) 

3 1 2 3 1 

Charges and currents act as a source (and sink) of the field A. 

In free space J = 0, p = 0, A 0 = 0, and there are plane wave solutions with 
wave vector k, frequency = | k |, of the form 

A(r, t) = ci £ cos(k • r — co^t). 

Here e is a unit vector and a is the wave amplitude. The gauge condition requires 
k - e — 0. Thus for a given k there are only two independent states of polarisation, 
£i(k) and £ 2 (k) say, perpendicular to k. The general solution in free space is 

A(r, ^ [flk “ el<kr "“ t> + fl ka e_l(kr ““ f) ]- (4.15) 

The complex number a^ a represents an amplitude and a phase, and the plane waves 
are normalised in a volume V, with periodic boundary conditions. The factor \Z2o\ 
is put in for convenience later. 

An important point apparent in the radiation gauge is that although the vector 
potential has four components A' 1 , one of these, A 0 , has no independent dynamics 
and another is a gauge artifact, which is eliminated by fixing the gauge. There are 
only two physically significant dynamical fields. 

The fields in any other gauge are related to the fields in the radiation gauge by a 
gauge transformation; the physics is the same but the mathematics is different. For 
some purposes it is better to work in the relativistically invariant ‘Lorentz gauge’. 
In the Lorentz gauge 

3^ = 0 (4.16) 

and the field equations become 

- V 2 ^) A M = J 11 . (4.17) 

4.5 Space inversion 

We now consider the operation of space inversion of the coordinate axes in the 
origin: r —> r' = —r, V —»■ V' = —V (Fig. 4.1), which was excluded from the 
group of proper Lorentz transformations. We shall also refer to this as the parity 
operation. The transformed coordinate axes are left-handed. By convention the 
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Figure 4.1 A normal right-handed set of axes (solid lines) and a space-inverted set 
(dashed lines). The space-inverted set is said to be left-handed. (O z is out of the 
plane of the page.) 


charge density is taken to be invariant under this transformation: if at some instant 
of time p F (r') is the charge density referred to the inverted coordinate axes, then 
p p (r') = p( r) when r' = —r. The current density J(r) = p(r) u(r), where u(r) is a 
velocity, and therefore transforms like dr/d t, an ordinary vector: J /> (r / ) = —J(r). 
Maxwell’s equations (4.1) retain the same form in the primed coordinate system 
if E(r') also transforms like a vector, E p (r') = —E(r), and B(r) transforms like an 
axial vector, B^r') = B(r). 

In terms of the potentials, equation (4.4) shows that we must take 

0 p (r') = 0(r), AV) = —A(r). (4.18) 

The field equations in a left-handed frame then have the same form as in a right- 
handed frame. The Lagrangian density (4.10) is invariant under space inversion. 
Electromagnetism is indifferent to handedness. 
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4.6 Charge conjugation 


It will also be of interest to note that Maxwell’s equations can be made to take the 
same form if matter is replaced by antimatter. As a consequence of this replacement 
both the charge and current densities change sign so that 

p(r) -* p c (r) = — p(r) and J(r) -* J c (r) = -J(r). 

Maxwell’s equations take the same form if we define 

4> c (j) = —0(r), A c (r) = —A(r). (4.19) 

This operation is called charge conjugation. As with Lorentz transformations and 
the parity transformation, the Lagrangian is invariant under the charge conjugation 
transformation. 


4.7 Intrinsic angular momentum of the photon 

Without embarking here on the full quantisation of the electromagnetic field, we 
can discuss the quantised intrinsic angular momentum, or spin, of the photons 
associated with plane waves of the form (4.15). 

The spin S of a particle with mass is defined as its angular momentum in a frame of 
reference in which it is at rest. In such a frame its orbital angular momentum L = 0, 
and its total angular momentum J — L + S = S. This definition is inapplicable to 
a massless particle, which moves with the velocity of light in every frame of refer¬ 
ence. However, for a massless particle moving in, say, the /-direction, it is possible 
to define the /-component S- of its spin, since the z-component of the orbital angu¬ 
lar momentum is L z = xp y — yp x , and p x = p y = 0 for a particle moving in the 
z-direction, hence L z = 0, and J z = S z . 

In quantum mechanics, the component J z of the total angular momentum operator 
of a system is given by 


J z - i hr z = i h lim[f?-(</>) — 1] </>, 

0 —>• 0 


(4.20) 


where R-(4>) is the operator that rotates the system through an angle (j) about Oz in 
a positive sense. 

Consider a term from (4.15) with k = (0, 0, k) along Oz: 


A(r, t) = 


V 2coV 


[(fli£ v + a 2 e Y )c' tkz a>,) + complex conjugate]. 


(4.21) 


The wave amplitudes a \ and a 2 are complex numbers, and we have taken the 
polarisation vectors e A and e y to be unit vectors aligned with the x- and y-axcs. A 
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rotation of A through an angle </> about Oz makes a change in the amplitudes that 
can be expressed by the rotation matrix equation 






( cos </> 
sin(/> 


— sin</> 
cos (j) 



In the limit —>■ 0, we have 

lim [R z (fi) - !]/</> - 


and 



The eigenvectors of J z /h are 



with eigenvalue + 1, 



with eigenvalue — 1. 

Thus we may say that a photon represented by the plane wave (4.21) has ‘spin 
one’, with just two spin states aligned and anti-aligned with its direction of motion. 
No meaning can be given to spin components perpendicular to the direction of 
motion. Classically these waves are right circularly polarised and left circularly 
polarised, respectively (Problem 4.4). 

A plane wave of any polarisation can be constructed by a suitable superposition 
of right-handed and left-handed circularly polarised waves. 


4.8 The energy density of the electromagnetic field 


The analysis of the energy density of the electromagnetic field in free space is a 
generalisation of the analysis for a scalar field set out in Section 3.6. Equation (3.25) 
becomes 


d£ 

Tfi = - -9 v A a - Sf£, 

v a(9„A^) 


and using this formula gives 


1 


t° 0 = -f 0 ^f°^ + -f uv f 


ILV 1 


/TV 


(4.22) 


(4.23) 
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(Problem 4.5). In terms of the physical fields E and B. (4.23) is the familiar expres¬ 
sion 

energy density = -(E 2 + B 2 ). (4.24) 

We can also express the fields in terms of the field amplitudes aka introduced in 
equation (4.15) and obtain for the total energy of the field 

H= f r°d 3 x = ^aj^a k „a> k . (4.25) 

J k.a 

Similarly the total momentum of the field is 

P = flk a Q k »k. (4.26) 

k.of 

4.9 Massive vector fields 

Let us modify the Lagrangian density (4.10) by adding an additional Lorentz invari¬ 
ant term, and consider 

£ = -\pnuF" v + l ~m 2 A^ (4.27) 

where J 11 is an external current. The additional term in the action is easily seen to 
modify the field equations to 

d IJ Fi' u + m 2 A u = r. (4.28) 

Since d l/ d /1 F ,lv = 0, it follows from (4.28) that 

in-daA" = (4.29) 

This equation is a necessary consequence of the field equations: it is not a Lorentz 
gauge-fixing condition like equation (4.16), but it does imply that the A v are not 
independent. Using this equation, the field equations simplify to 

d^A u + m 2 A v = J v + 9"(9 ^J^)/m 2 . (4.30) 

Hence in free space each component of A v of the field satisfies 

d 2 A v 

—- - V 2 A u +m 2 A v = 0. (4.31) 

dt 2 

This wave equation is related by the quantisation rules E —> id/dt, p —> —iV, to 
the Einstein equation for a free particle, 

E 2 = p 2 + m 2 . 
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We may conclude that our modified Lagrangian, when quantised, describes particles 
of mass m associated with a four-component field, of which three components are 
independent. 

Plane wave solutions of (4.31) are of the form 

A" = ae v cos(k-r — u>kt) = ae v cos 

where a>k = k° = s/m 2 + k 2 . To satisfy the condition d v A v = 0 we need 

k v e v = 0. (4.32) 

For example, if we consider a plane wave in the z-direction with k v = ( k° , 0, 0, k) 
there are three independent polarisations, labelled 1, 2, 3, which we may take as 
the contravariant four-vectors 

e\ = (0, 1,0,0), 
s v 2 = (0,0, 1,0), 

£3 = (Jk, 0 , 0 ,k°)/m. 

The intrinsic spin of a particle is its angular momentum in a frame of reference 
in which it is at rest (Section 4.7). In such a frame k = 0, and £1 = (0, £*), £2 = 
(0, £ y ), £3 = (0, £ z ). As in Section 4.7, the states with polarisation i x ± i£ y cor¬ 
respond to J z = ±1, but we now have also the state with polarisation £ z , which 
corresponds to J z = 0, since the operator r z acting on e z gives r z t z — 0. 

Thus our modified Lagrangian describes massive particles having intrinsic spin 
S with S = 1 and S z = 1, 0, — 1. That such particles are important in the Standard 
Model will become evident in later chapters. 


Problems 

4.1 Show that the Lagrangian density of equation (4.10) can also be written 

£ = ^(E 2 -B 2 )- 

4.2 Suppose that in a certain gauge V • A = /(r, t) / 0. Find an expression for a gauge 
transforming function /(r, t ) such that the new potentials given by equation (4.12) 
satisfy the radiation gauge condition. 

4.3 Show that the tensor field = \e llva pF 0l P has the same form as F^ v but with the 
electric and magnetic fields interchanged. Show that 

\f^ v = E-B 

and that it is a scalar field under Lorentz transformations but a pseudoscalar under the 
parity operation. 
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4.4 Show that the electric field of the wave of equation (4.21) with a\ = 1, a 2 = i, is 



( E x , E y , E z ) = — J — [sin(&z — cot), cos (kz — cot), 0]. 


Show that as a function of time, at a fixed z, E rotates in a positive sense about the 
z-axis. This is the definition of right circular polarisation. 

4.5 Show that equation (4.22) gives immediately 


F^doA^ + ^F^. 



Show that the term 9 lU (Ao/' , ° M ) = dj(AoF 0 ') can be added to this without changing 
the total energy. Hence arrive at the form for T j] given in equation (4.23). 

4.6 A particle of mass m, charge q, is moving in a fixed external electromagnetic field 
described by the four-potential (</>, A). Show that the Lagrangian 



gives the non-relativistic equation of motion 


mx = q(E + x x B), 


and the Hamiltonian is 



2m 


where p = mx + q A. 


4.7 Show that for a particle the action S = fL dr is Lorentz invariant if yL is Lorentz 
invariant. Verify that this condition is satisfied by the Lagrangian 


L = —m/y — qA^Xdx^/dt). 


(This gives the relativistic version of Problem 4.6.) 
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The Dirac equation and the Dirac field 


The Standard Model is a quantum field theory. In Chapter 4 we discussed the 
classical electromagnetic field. The transition to a quantum field will be made in 
Chapter 8. In this chapter we begin our discussion of the Dirac equation , which was 
invented by Dirac as an equation for the relativistic quantum wave function of a 
single electron. However, we shall regard the Dirac wave function as a field, which 
will subsequently be quantised along with the electromagnetic field. The Dirac 
equation will be regarded as a field equation. The transition to a quantum field theory 
is called second quantisation. The field, like the Dirac wave function, is complex. 
We shall show how the Dirac field transforms under a Lorentz transformation, and 
find a Lorentz invariant Lagrangian from which it may be derived. 

On quantisation, the electromagnetic fields A t ,(x), F jiv (x) become space- and 
time-dependent operators. The expectation values of these operators in the environ¬ 
ment described by the quantum states are the classical fields. The Dirac fields xjs(x) 
also become space- and time-dependent operators on quantisation. However, there 
are no corresponding measurable classical fields. This difference reflects the Pauli 
exclusion principle, which applies to fermions but not to bosons. In this chapter 
and in the following two chapters, the properties of the Dirac fields as operators are 
rarely invoked: for the most part the manipulations proceed as if the Dirac fields 
were ordinary complex functions, and the fields can be thought of as single-particle 
Dirac wave functions. 


5.1 The Dirac equation 


Dirac invented his equation in seeking to make Schrodinger’s equation for an elec¬ 
tron compatible with special relativity. The Schrodinger equation for an electron 
wave function x[r is 


dx[r 

i— = Hxl/. 
3 1 
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The Dirac equation and the Dirac field 


To secure a symmetry between space and time, Dirac postulated the Hamiltonian 
for a free electron to be of the form 

H d = a. p + fim = —i a ■ V + fim, (5.1) 

where m is the mass of the electron, p its momentum, a = (a \, ffi, 03 ), and 
ai, of 2 , 013 and fi> are matrices. \jr is a column vector, and the Schrodinger equa¬ 
tion becomes the multicomponent Dirac equation: 

(id / dt + ict • V — = 0. (5.2) 


If this equation is to describe a free electron of mass m, its solutions should also 
satisfy the Klein-Gordon equation of Section 3.5. Multiplying the Dirac equation 
on the left by the operator (id/dt — ia • V + yd/77), we obtain 


-d 2 /dt 2 + ^a ( 2 3/3,- + ^(cqa tj + a/ai)didj 


l <] 


+ i m ^ (otifi + fictj)dj - 


2 2 
m 


x[r = 0, 


where 3/ = 3/ dx'. This equation is identical to the Klein-Gordon equation if 

p =1, a { = a 2 = a 3 = 1, 

djCtj + djdi = 0, i 7 ^ j ; a,+ /la,' =0, z = 1,2,3. (5.3) 


The reader may recall that similar equations are satisfied by the set of 2 x 2 Pauli 
spin matrices a — (a 1 , a 2 , a 3 ), where it is conventional to take 


a 


l 






(5.4) 


We shall also find it useful to write 


cr° = 


1 0 
0 1 


for the 2 x 2 unit matrix. 

However, here we have four anticommuting matrices, the a, and p, to represent. 
It proves necessary to introduce a second set of Pauli matrices and represent the 
a, and yd by 4 x 4 matrices. The representation is not unique: different choices are 
appropriate for illuminating different properties of the Dirac equation. We shall use 
the so-called chiral representation, in which 


a 




(5-5) 
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writing the matrices in 2 x 2 ‘block’ form. Here 

0=f° ° 
\0 0 

and the 4x4 identity matrix may be written 


I = 


a 0 0 
0 cr c 


It can easily be checked that these matrices satisfy the conditions (5.3). (The block 
multiplication of matrices is described in Appendix A.) 

Since the a,- and /I are 4x4 matrices, the Dirac wave function xj/ is a four- 
component column matrix. Regarded as a relativistic Schrodinger equation, the 
Dirac equation has, as we shall see, remarkable consequences: it describes a par¬ 
ticle with intrinsic angular momentum ( h/2)a and intrinsic magnetic moment 
(qh/2m)o if the particle carries charge q, and there exist ‘negative energy’ solu¬ 
tions, which Dirac interpreted as antiparticles. 

A Lagrangian density that yields the Dirac equation from the action principle is 

£ = i/^'(i3/3t + ict • V — f5m)xf/ 

= x/f*(I ab id/dt + ia ab • V ^ ab m)^ h , (5.6) 


where we have written in the matrix indices, xfr* is a row matrix, the Hermitian 
conjugate x i/A = \j/ ] * of xjs. Instead of varying the real and imaginary parts of xjs a 
independently, it is formally equivalent to treat xf/ a and its complex conjugate xj/* as 
independent fields (cf. Section 3.7). The condition that S = f £d 4 x be stationary 
for an arbitrary variation Sxjr* then gives the Dirac equation immediately, since £ 
does not depend on the derivatives of x/r*. 


5.2 Lorentz transformations and Lorentz invariance 


The chiral representation (5.5) of the matrices a' and /? is particularly convenient 
for discussing the way in which the Dirac field must transform under a Lorentz 
transformation. We have written the Dirac matrices in blocks of 2 x 2 matrices, 
and it is natural to write similarly the four-component Dirac field as a pair of 
two-component fields 


x(r = 



(5.7) 
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where xfi L and xfi R are, respectively, the top and bottom two components of the 
four-component Dirac field: 


Vh. = 




The Dirac equation (5.2) becomes 

/ct° 0 \ / 3 0 Vt \ {~ a ‘ 0 
\0 CT°/\ v 3 0 fR/ \ 0 G 


( 9/Vt\ 

\diif R ) 





Block multiplication then gives two coupled equations for xfi L and i// R : 

io-°3 0 ^L - icr'difi L - mfi R = 0, 

io-°3 0 ^R + icr ! 3, T^R - mxfi L = 0. 


(5.8) 


= 0 . 
(5.9) 

(5.10) 


We shall find it highly convenient for displaying the Lorentz structure to define 


o" 


_ /„() 1 2 _3\ _ / 0 

— ^CT , O , O , O ), O — (CT , 


-a 


-a 


-cr 3 ). 


With this notation, the equations (5.10) may be written 

idr^Vh. - = 0 , 

ior^Vht - = 0. 

To obtain the Lagrangian density (5.6) in terms of i/v and i// R , we need to 
multiply the expression on the left-hand side of (5.9) by the row matrix (fil, i/qj,), 
where the Hermitian conjugate fields are i//, = (x[r*, fit), fifi). Block 

multiplication gives 

£ = i+ V^Vt)- (5.12) 


Variations Sxfi £ and <hj/^ in the action give the field equations (5.11). 

To show that the Lagrangian has the same form in every frame of reference, 
we must relate the field fi'(x') in the frame K' to fi(x) in the frame K, when x' 
and v refer to the same point in space-time, and are related by a proper Lorentz 
transformation 


x ,,L = ZA,;t v . (5.13) 

The operator 3 ;/ transforms like a covariant vector, so that 

K = V9v 

which has the inverse 

9/x = L\& v . (5.14) 

(See Problem 2.2.) 
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It is shown in Appendix B (equations (B.17) and (B.18)) that with this Lorentz 
transformation we can associate 2x2 matrices M and N with determinant 1 and 
with the properties 

Mff’MsL'/, (5.15) 

N t ff w N = L v #t tr' 4 . (5.16) 

The matrices M and N are related by (B.19): 

M'N = N'M = 1. (5.17) 


In the frame K' the Lagrangian density (5.12) can be written 

£ = iVq[]Vr<7 y M9'i/r L + i^N t cr v N3'i/^R — m(^L^R + iA'rV'l), (5.18) 

where we have used (5.14) along with (5.15) and (5.16) in the first two terms. 

We must define 


V'lCx') = M^lCO, (5-19) 

= N^(J), (5.20) 


to give 

£ = iVqVa'^L + - "KVtVr + ^rVl) 

(noting that xj/^xj/' R = ^M^N^r = t/tV'R’ since JVTN = I, and similarly i/qji//^ = 
V'r’Al)- 

With the transformations (5.19) and (5.20) the Lagrangian, and hence the field 
equations, take the same form in every inertial frame. The way to construct an M 
and an N for any Lorentz transformation is given in Appendix B. 

An example of a rotation is 


p — 

L-j x) - 


/I 

0 

0 

0 \ 

0 

cos 6 

sin 9 

0 

0 

— sin 6 

cos 6 

0 

\0 

0 

0 

l) 


(5.21) 


This is a rotation of the coordinate axes through an angle 6 about the z-axis and is 
equivalent to equations (2.1). The corresponding matrix M is unitary: 

V 0 / 2 0 \ 

M = I Q e _ ie/2 ) • (5-22) 


Hence, from (5.17), N = (1VL) 1 = M, since MM’ = 1. The reader may verify that 
(5.15) and (5.16) hold. M is unitary (and hence equal to N) for all rotations. 
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An example of a Lorentz boost is 





/ cosh 6 

0 

0 

—sinh tD 

= 

0 

1 

0 

0 


0 

0 

1 

0 


^ — sinh 6 

0 

0 

cosh 9 / 


This is a boost with velocity v/c = tanh 9 along the z-axis and is equivalent to 
equations (2.3). The corresponding matrix M is 

/ e 0/2 n \ / e -0/2 n \ 

M =(o a " d N = < M, )" l = (o e ./ 2 )= M_l - (5.24) 

5.3 The parity transformation 

The Lagrangian density (5.12) can also be made invariant under space inversion 
of the axes. Denoting by a prime the space coordinates of a point as seen from the 
inverted axes, we have 

r' = -r and V' = -V. (5.25) 

Hence, from the definitions (5.10) of and d // , 

<7^; = <7%. (5.26) 

Our Lagrangian density (5.12) is evidently invariant if —> \j/ p (r') where 

^(O^Wr), ^(r') = Vr(r). (5.27) 

Actually the Lagrangian density would also retain the same form if we were to 
take, for example, 

tltf) = e‘"^R(r), ^R(r') = e 10, ^L(r), 

for any real a. It is the standard convention to adopt the form (5.27) for the field 
transformation under space inversion. 


5.4 Spinors 

Two-component complex quantities that transform under a Lorentz transforma¬ 
tion according to the rules (5.19) and (5.20) are called left-handed spinors and 
right-handed spinors, respectively. Our subscripts L and R anticipated this. The 
four-component Dirac field is often called a Dirac spinor. 

Spinors have the remarkable property that they can be combined in pairs 
to make Lorentz scalars, pseudoscalars, four-vectors, pseudovectors and higher 
order tensors. For example, (i/q^R + V'rV'l) is a Lorentz invariant real scalar 
and KV'l.V'r — V / rV / i.) is a rea l pseudoscalar; it is invariant under proper Lorentz 
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transformations but changes sign under space inversion. Using (5.15), (5.16) and 
(5.27), we can see that (V/ | '<t / '\//|. + V / r°' // V / r) is a four-vector, the space-like 
components of which change sign under space inversion (since <?' = —cr l ), and 
(V/[d / ' i//[. — iAr) is an axial four-vector, the space-like components of which 
are unchanged under space inversion. 


5.5 The matrices y 1 ' 

The separation of the Dirac spinor into left-handed and right-handed components 
will be particularly appropriate when we discuss the weak interaction. For describ¬ 
ing the electromagnetic interactions of fermions it is convenient to introduce 4x4 
matrices y 11 defined by 


y° = y‘ = j3a,-, i = 1, 2, 3. 

It follows from the properties of the j3 and a 1 matrices that 
(y°) 2 = I; (y') 2 = -I, i = 1,2,3; 

yU-yV + yVyH = (), /X^V. 

In the chiral representation, 



(5.28) 


(5.29) 


(5.30) 


Written with the y 1 ' matrices, the Lagrangian density (5.6) becomes 


£ — i//-(iy M 9 |Lt — (5.31) 

where iA is the row matrix i Jr = \j/' y { \ and the Dirac equation takes the symmetrical 
form 


(iy M 9/x — m)^r = 0. 


(5.32) 


Another useful matrix y 5 = iy°y 1 y 2 y 3 . In the chiral representation, 

-or 0 0 


y 5 = 


o 


a 


The matrices |(l-y 5 ), |(I + y 5 ) are projection operators giving the left-handed 
and right-handed parts of a Dirac spinor: 

r ° / 'h\ _ / 

v^A R/ \0 


la -vv=(; " 




Vfi. 

IAr 


0 

iAr 


(5.33) 

(5.34) 
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The Dirac equation and the Dirac field 


It is straightforward to verify that the Lorentz scalars and vectors constructed in 
Section 5.4 from two-component spinors can be written: 

V'lV'r + V'rV'l =V' , V f (scalar) 

V'r — V'rV'l) = (pseudoscalar) 

+ V'rO^V'R = fiy ll fi (contravariant four-vector) 

— V'r^V'r =\jry 5 y ll fi (contravariant axial vector). 

Note that these quantities are all real. 


5.6 Making the Lagrangian density real 

A potential problem with our Lagrangian density (5.6) or (5.12) is that it is not 
real. Regarding x[r as a wave function, £ is a complex function; regarding xfi as an 
operator, £ is not Hermitian. As a consequence, the energy-momentum tensor is 
complex. Indeed, to apply Hamilton’s principle, the variation 8S in the action must 
be real. The term — r + in (5.12) is real, and the imaginary part of 

£ may be written 

(l/2i)[ii/q l d /i a M V''L +i^Rcr M 3 A <V f R - (iV^^Vr + 

= (l/2i)[i^LCT M 3 A( V f L Tii/q^a^R + 

(where we have used the Hermitian property of the matrices cr M and a 1 ' ). The last 
expression is just 

(1/2)3^ + iArO-^r)- 

This is a sum of derivatives, which give only irrelevant end-point contributions 
to the action (cf. Section 3.1). Hence 8S is real. The imaginary part of £ can be 
discarded, and we can take 

£ = + iv4cr%f R ) (5.35) 

+ Hermitian conjugate] — + filfii,)- (5.36) 

For further interesting discussion of this question see Olive (1997). 


Problems 

5.1 Show that the matrix M = N of equation (5.22) when inserted into equations (5.15) 
and (5.16) generates the rotation matrix (5.21). 

5.2 Show that the matrices M and N = M 1 given by equation (5.24) when inserted into 
equations (5.15) and (5.16) generate the Lorentz boost of equation (5.23). 
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5.3 Show that V^rV^l and i/'lV'r are invariant under proper Lorentz transformations. 

Show that and are contravariant four-vectors under proper 

Lorentz transformations. 

Show that i/ , 'RC rM <T 1 'i/r L and iare contravariant tensors under proper 
Lorentz transformations. 

5.4 Demonstrate the equivalence of the expressions (5.6) and (5.31) for the Lagrangian 
density. 

5.5 Show that y 5 has the properties 

(y 5 ) 2 = I; y"y 5 = -y 5 y^; M = 0, 1,2, 3. 

5.6 Show that ii/iy 5 i/f is a pseudoscalar field andi/ry 5 y^i/f = — i/iy M y 5 i/r is an axial 
vector field. 

5.7 Show that (y°)t = y°, (y')^ = —y‘. 
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Free space solutions of the Dirac equation 


In this chapter we display the plane wave solutions of the Dirac equation. We show 
that a Dirac particle has intrinsic spin h/2, and we shall see how the Dirac equation 
predicts the existence of antiparticles. 


6.1 A Dirac particle at rest 

In Chapter 5 we showed that the Dirac equation for a particle in free space is 
equivalent to the coupled two-component equations 

id%Vr -mf R = 0, 
ior^a^R - m= 0. 

These equations have plane wave solutions of the form 

Vt = M L e i(p r_£f) , t/tr = MR e i(p - r - £f) , 

where u\ and u R are two-component spinors. Since solutions of the Dirac equation 
also satisfy the Klein-Gordon equation (3.19), we must have 

E 2 = p 2 +m 2 . (6.3) 

It is simplest to find the solution in a frame K' in which the particle is at rest, and 
then obtain the solution in a frame in which the particle is moving with velocity v 
by making a Lorentz boost. Using primes to denote quantities in the frame K' , the 
momentum p = 0, so that equations (6.1) and (6.3) become 

idoV'L = m ^R’ i9 o^R = 

E' 2 — m 2 , E' = ±m. (6.4) 


( 6 . 1 ) 

( 6 . 2 ) 
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6.2 The intrinsic spin of a Dirac particle 
The solutions with positive energy E' = m are 
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(6.5) 


where 


u = 



is an arbitrary two-component spinor and we are adopting the standard convention 
of quantum mechanics that the time dependence of an energy eigenstate is given 
by the phase factor e~ lEt . 

In the rest frame K', the left-handed and right-handed positive energy spinors 
are identical. As a consequence this solution is invariant under space inversion (see 
Section 5.3). It is said to have positive parity. 

6.2 The intrinsic spin of a Dirac particle 

The intrinsic spin operator S of a particle with mass is defined to be its angular 
momentum operator in a frame in which it is at rest. The component of S along the 
z-direction is given by 

S z — ih lim [R-fc/)) - l]/</>, 

0—>0 

where R z (4> ) is the operator that rotates the state of the particle through an angle </> 
about Oz (cf. Section 4.7). A rotation of the state through an angle </>, is equivalent to 
rotating the axes through an angle —<p, and then xfr L —> xfr R —> Nxfr R where, 

from (5.22), 



Hence 



1 / e -W2 _ l 0 



h 

-er- 


In the state with u\ = 1, m 2 = 0, 


S z fri = (h/2)fi 


and 


S Z K = (fi/ 2)Vcr. 
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Free space solutions of the Dirac equation 


Acting on the Dirac wave function, we have 






Similarly, in the state with u\ = 0, «2 = 1, 


5, 



= -(ft/2) 



( 6 . 6 ) 


(6.7) 


Thus in the rest frame of the particle there are two independent states which 
are eigenstates of S- with eigenvalues ± (/i/2). The operator .S, on a Dirac wave 
function is represented by the matrix 

E = = <fi/ 2 ) (o = ')• (6 - 8) 

More generally, S is represented by 

E = <*/2)(; ®). (6.9) 

Also, every Dirac wave function is an eigenstate of the square of the spin oper¬ 
ator, 


£ 2 = (3/4)ft 2 I, 

with eigenvalue (3/4)/? 2 = (l/2)((l/2) + 1 )/r. Recalling that the square .J 1 of 
the angular momentum for a state with angular momentum j is j( j + l)/i 2 ; it is 
appropriate to say that a Dirac particle has intrinsic spin fi /2. 


6.3 Plane waves and helicity 

We now transform to a frame K in which the frame K ', and the particle, are moving 
with velocity v. For simplicity we take v = (0, 0, v), along the z-axis with v > 0, 
and consider the state with u\ — 1, U 2 = 0. 

Transformations between K and K' are then given by (5.23), along with (5.24). 
Using (5.19) and (5.20), 


Vt = m 1 = 

= A!-Vr = 


/ e - 0 / 2 0 \ 

Vo e e / 2 ) 

( e 0 / 2 0 \ 

Vo e - 0 / 2 ) 


Q~ mt ' 
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Finally, substituting t' = t cosh 6 — zsinhS (and noting that m cosh 0 = ym = 
E , m sinh# = ymv = p, where y — (1 — ir/c 2 ) -1 / 2 we have 

Vt = e' (pi ~ E,) , V^r = z' (pz ~ E,) ■ (6-10) 


The helicity operator is useful in classifying plane wave states. It is defined by 


helicity = 


S-P 

Ipl ' 


( 6 . 11 ) 


The expectation value of this operator in a given state is a measure of the alignment 
of a particle’s intrinsic spin with its direction of motion in that state. For p = 
(0, 0, p), p > 0, the helicity operator £ ■ p/|p| = £-.. Thus the state (6.10) is an 
eigenstate of the helicity operator with positive helicity 1/2, which we can write as 
a Dirac spinor 


x/f + = J_ e i( ^- £r) 

V2 


(c- e ' 2 \ 

0 

e 0/2 

Vo ) 


p > 0. 


( 6 . 12 ) 


We have inserted the normalisation factor 1/V2 to conform with the standard 
normalisation of the Lorentz scalar x[r\fr: 

\j/ \[r = ^y^ip- = + V'rV'l = 1. 


Similarly, taking u \ — 0, u 2 = 1, we can construct an eigenstate of negative 
helicity —1/2: 


+~ = vl e 


,i (pz-Et) 


/O 

e 0/2 

0 

\e“0/ 2 




, P >0. 


(6.13) 


All plane waves with positive energy can be generated by applying rotations to the 
states we have found. The helicity of a state is unchanged by a rotation, since it is 
defined by a scalar product. The evident generalisations of (6.12) and (6.13) to a 
wave with wave vector p are 

= e l(pr_£r) n + (p) (6.14) 


where 


«+(P) 


We- e / 2 |+)\ 

V2 V e 0/2 l+> ) 
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Free space solutions of the Dirac equation 


and 


x/r. = e l(p - r - £f V(p) 


(6.15) 


where 


u- (p) = 



The Pauli spin states |±) are here the eigenstates of the operators a ■ p/|p| with 
eigenvalues ± 1 (Problem 6.6). A general state of positive energy can be constructed 
as a superposition of plane waves. 


6.4 Negative energy solutions 


In the frame K' in which the particle is at rest, there are also negative energy 
solutions of (6.4) with E' = —nr. 



(6.16) 


In this case the left-handed and right-handed spinors v differ in sign. Thus the 
negative energy solution changes sign under space inversion (see Section 5.3). It is 
said to have negative parity. 

The same Lorentz boost we used above in Section 6.3 gives solutions xfr + and 
i//_ with positive and negative helicity, respectively, which we can write as Dirac 
spinors 


/° \ 

e e /2 


/—e- e / 2 \ 


1 


.H-pz+Et) e 

0 



1 


= vf’ 


V-e- 0 / 2 / 


V 0 J 

(6.17) 


These solutions generalise to 


r/r+ = e i( - pr+£ V(P) 


(6.18) 


where 



and 


= e 1( P ' r+£,) i;_(p) 


(6.19) 


where 
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|+) and |—) remain eigenstates of cr • p/|p| as defined below (6.15). Note that 
the Lorentz invariant \j/fr acquires a minus sign; in the case of the negative energy 
solutions, 


x//xfr = i y L x// R + V/rtAl = -1. 


Negative energy solutions of the Dirac equation appear at first sight to be an 
embarrassment. In quantum theory a particle can make transitions between states. 
Hence all Dirac states would seem to be unstable to a transition to lower energy. 
Dirac’s solution to the difficulty was to assume that nearly all negative energy 
states are occupied, so that the Pauli exclusion principle forbids transitions to them. 
An unoccupied negative energy state, or hole, will behave as a positive energy 
antiparticle, of the same mass but opposite momentum, spin, and electric charge. 
Left unfilled, the negative energy state \// + of (6.17) corresponds to an antiparticle 
of positive energy E and positive momentum p, and positive helicity, since the spin 
of the hole is also opposite to that of the negative energy state. 

A particle falling into an empty negative energy state will be seen as the simulta¬ 
neous annihilation of a particle-antiparticle pair with the emission of electromag¬ 
netic energy > 2 me 2 . Conversely, the excitation of a particle from a negative energy 
state to a positive energy state will be seen as pair production. The existence of the 
positron, the antiparticle of the electron, was established experimentally in 1932, 
and the observation of pair production soon followed. 

The uniform background sea of occupied negative energy states, with its asso¬ 
ciated infinite electric charge, is assumed to be unobservable. In any case, it is 
clearly quite arbitrary whether, say, the electron is regarded as the particle and the 
positron as antiparticle, or vice versa. Evidently our starting interpretation of the 
Dirac equation as a single particle equation is not tenable. We are led, inevitably, 
to a quantum field theory in which particles and antiparticles appear as the quanta 
of the field, in somewhat the same way as photons appear as the quanta of the 
electromagnetic field. We shall take up this theme in Chapter 8. 


6.5 The energy and momentum of the Dirac field 


The Lagrangian density of the Dirac field is given by (5.31), which we display in 
more detail: 


£ — lAliy^S/x — m)t/r 

= iiA*doVL + 'I'bifYbA ~ m6 b a)^a 


( 6 . 20 ) 


As in Section 5.1 we may treat the fields f/ a and f,,* as independent, and take the 
energy-momentum tensor to be 


pt 1 _ 

± V 



( 6 . 21 ) 


(£ does not depend on 3^1 f/ a *). 
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Free space solutions of the Dirac equation 


In particular, the energy density is 

r 0 ° = \r a ^a - £ 

= fr{—iy'di + m)fr (6.22) 

and the momentum density is 

T,° = ir a diit a =i^dixlr. (6.23) 

The general solution of the free space Dirac equation is a superposition of all 
possible plane waves, which we will write 

* = -)= J2 Sf{bveU E ( p)e 1(p r - £ " r) + d; E v e ( p^-P-^O). (6.24) 

V y p,e V 

s is the helicity index, ±, and b pf and d pi are arbitrary complex numbers. The 
factors J(m/E p ) take the place of the factors 1 /y/lcok we inserted in the boson 
field expansions of Chapter 3 and Chapter 4. 

We can express the total energy and total momentum of the Dirac field in terms of 
the wave amplitudes, by inserting the field expansion into T [j and j9, and integrating 
over the normalisation volume V. The results are 

H=J2K b ^- d vsd; s )E P , (6.25) 

p.£ 

P=T,K b ve- d ved; e ) p. (6.26) 

P,£ 

e — ± 1 is the helicity index. 

The (somewhat tedious) derivation of these results is left to the reader. Note that 
each plane wave is a solution of the Dirac equation (5.32), which implies 

(y°E p - y , p , )u £ (j>) = mu s ( p), 

( Y°Ep ~ y'p')v e ( p) = -mv e ( p). 

It is also necessary to use various orthogonality relations, which are set out in 
Problem 6.3. 

For later convenience, we rewrite the Dirac field x// (6.24) in terms of xfr L and 
V^r. Using (6.14), (6.15), (6.18) and (6.19) gives 

*.=^ e i+)+ b,-^ 1 1-» 

+ {d; + e e/2 |-) - £/ p *_e- 0 / 2 |+)) e i(- P' r+£ °] (6.28) 

[(*P + = e/2 1+) + h-e-’> 2 l-» 

+ (-d; + e~ e/2 |-) + d;_e, e/2 |+)) e l( - pr+£f) ] (6.29) 
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6.6 Dirac and Majorana fields 

The expansion (6.24) is the general solution of the free field Dirac equation. For 
every momentum p there are four independent complex coefficients: b p+ , fi p _, d* + 
and d *_, which correspond to particles with helicities +1/2, —1/2 and antiparticles 
with helicities +1/2, —1/2, respectively. 

It will be of interest, in Chapter 21 , to consider solutions in which we impose the 
constraint that d p+ = b p+ , d p - = fi p _, and hence d* + = b * + , d*_ — b*_. These 
solutions are known as Majorana fields. On quantisation, we shall see that the 
Dirac fields create and annihilate particles, and antiparticles. For example, if // is 
an electron field it creates positrons and annihilates electrons, x//^ creates electrons 
and annihilates positrons. With the Majorana constraint, particles and antiparticles 
are identical. Majorana fields are irrelevant for electrically charged particles, but it 
is possible that the electrically neutral neutrino fields have this property. It is still 
an open question whether neutrino fields are Dirac or Majorana. 


6.7 The E ^>> m limit, neutrinos 

The coefficients of the plane waves in the expansions (6.25) and (6.26) may be 
expressed as 

J{m /2 E)e ±e/2 = {(1 ± v/c) /2} 1/2 , (6.30) 


where v is the particle velocity (Problem 6.1). In the high energy limit, E +> m, the 
velocity v —> c. The only significant terms in the field expansions which survive in 
this limit are 


fL = 7F? (VH 
^ R = X (Vt l + ) 


Mpr - E,) + d; + |_>e l( -P- r+£ °), 

(6.31) 

Xp-v-Et) + d *_ | + ) e i(-p.r+£0^ 

(6.32) 


In the limit, t/+ and i// R are completely independent: i// L involves only nega¬ 
tive helicity particles and positive helicity antiparticles; \fr R involves only positive 
helicity particles and negative helicity antiparticles. 

Since neutrinos are electrically neutral, they are accessible to experimental inves¬ 
tigation only through the weak interaction and we shall see in Chapter 9 that in the 
weak interaction Nature only employs xfr L . In practice neutrino energies are usually 
many orders of magnitude greater than their mass, so that only negative helicity 
neutrinos and positive helicity antineutrinos are readily observed. It has not so far 
been established that the ‘hard to see’ positive helicity neutrino is different from 
the ‘easy to see’ positive helicity antineutrino. 
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Problems 

6.1 With the normalisaion of 0 + determined by equation (6.14), show that 

x[r\_x[r + = cosh# = E/m. 

(Note that this is not the usual normalisation of particle quantum mechanics.) 

Show that the probability of this positive helicity state being in the right-handed 
mode is 

e e /(2 cosh#) = (1 + v/c)/ 2 

and the probability of its being in the left-handed mode is (1 — v/c)/ 2. What are the 
corresponding results for i/r_ ? 

6.2 Show that the negative energy positive helicity state of equation (6.18) has probability 
(1 + v/c)/ 2 of being in the left-handed mode. 

6.3 Show that 

M±(p)w±(p) = v±(p)u±(p) = E p /m, 

m±(P)m t (p) = i>±(p)%(p) = 0, 

M±(p)u±(-p) = i>±(-p)K±(p) = M±(p)%(-p) = u^(-p)M±(p) = 0. 

These results are useful in Problem 6.4. 


6.4 Using the plane wave expansion (6.24) and the energy-momentum tensor components 
(6.22) and (6.23), show that the energy and momentum carried by the wave 0 are 
given by (6.25) and (6.26). 

6.5 Consider a momentum p in the direction specified by the polar coordinates # and 0. 


p = (sin# cos0, sin# sin0, cos#). 


Show that 


and the Pauli spin states 


er-p 


cos # sin 0 e 

sin 0 e 1 ^ — cos 0 


/cos(#/2) \ 

1 ; \sm(d/2)e^) 


- sin(#/2)e~^ 
cos(#/2) 


are the helicity eigenstates appearing in (6.14) and (6.15). An overall phase is 
undetermined. 
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Electrodynamics 


In this chapter we set up a Lagrangian for a field theory in which electrically charged 
Dirac particles and antiparticles, for example electrons and positrons, interact with 
and through the electromagnetic field. To facilitate reference to other texts, and 
for conciseness, we work with four-component Dirac spinors and the matrices y 1 ' 
introduced in Section 5.5. 


7.1 Probability density and probability current 

We have seen in previous chapters how conservation laws are associated with 
symmetries of the Lagrangian. The Lagrangian density (5.31), 

£ — xfr^iy^d^ — m)x//, 

is invariant under the transformation 

\j/ (x) t/t' (x) = e~ l0l \j/ (x), (7.1) 

where a is a constant phase. These transformations form a group t/(l) (see 
Appendix B) and are said to be global: the same at every point in space and time. 

If now we allow an arbitrary small space- and time-dependent variation in 
a, a —> a' (jc) = a + 8a ( x ), and if the fields satisfy the field equations, the cor¬ 
responding first-order variation 8S in the action must be zero, since S is stationary 
for the actual fields. The variation comes from the operators 3^ acting on e -li “W, 
so that 

3 S = J ifry ll '^idfj i e~' Sa d 4 x 

= J (8a) d 4 x, to first order. 
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Integrating by parts, 

8S = - J d 4 x. 

This is zero for any arbitrary function Sa(x) only if 


Wy'V) = o. (7.2) 

At each point x of space and time, xjr (x) y^xjr (x) transforms like a contravariant 
four-vector (Section 5.5) and we may define the contravariant field 


j^{x) = xfry^x/f = (P{x),\(x)) (7.3) 


where P (x) = xfry°x// = x/s^ (y 0 ) 2 x/r = x\r*xjr a 
familiar form 


4 

= iVaI 2 - Then (7.2) takes the 

a =1 


3 P 
~di 


+ V j 


= 0 . 


(7.4) 


If P(x) is interpreted as the particle probability density associated with the wave 
function xfr(x) and \(x) as the probability cuncnt, (7.4) expresses local particle 
conservation. Integrating over all space, and using the divergence theorem, it follows 
that for fields that vanish at large distances 


d 
d t 


/ 


Pd 3 x = 0. 


Hence 

J P(t,x) d 3 x = J xj/^xf/ d 3 x 

is a constant independent of time. With xjj (x) taken to be a normalised wave function 
for a particle, the constant is unity, and we see that a wave function once normalised 
stays normalised. In Chapter 8 we shall see that in a second quantised field theory, 
/ P(t,x)d 3 x is an operator that counts the number of particles minus the number 
of antiparticles, and thus this number is conserved. 

We could have derived (7.2) from the field equation but the device introduced 
here, whereby the conservation law appears as a consequence of the 17(1) symmetry 
(7.1), is both elegant and economical. 


7.2 The Dirac equation with an electromagnetic field 

In classical mechanics, the Hamiltonian for a particle carrying charge q moving in 
an external electromagnetic field specified by the electromagnetic potentials (0,A) 
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is obtained from the free particle Hamiltonian by the substitution in (3.8) 


or, equivalently 


qfi, p^p-gA, 


p* -* p^-qA 11 , 


(7.5) 


where p 1 ' — (E, p) is the energy-momentum four-vector of the particle. (See Prob¬ 
lems 4.6 and 4.7.) With the quantisation rule p lt —* i 3 ^, (7.5) suggests that the 
Dirac equation in the presence of an electromagnetic field should be 


[y M ( idfi - qA M ) - m]x// = 0, 


(7.6) 


and there should be a corresponding substitution in the Lagrangian density. 

Using (4.10) and (5.31), we take the Lagrangian density for the Dirac field 
together with the electromagnetic field with external charge-current sources J 11 to 
be 


£ = ir[y^ i3 M - q A„) - m]fi - - J^A^ 

= ~ m]\fi - ] -F lxv F lxv - (/^ + qx^y^fi) A jJL . 


(7.7) 


The Lagrangian is still invariant under the transformation (x) —> fi' (x) = 
e -1 “i ft (x) with a constant, and this leads as before to particle conservation: 


= 0, = fy^xfi. 


(7.8) 


Variation of the fields A t , in the action, as in Section 4.2, yields the Maxwell 
equations, with charge-current density 

+ qfiy li xjr = J 11 Aqi 11 . (7.9) 


In (7.8) and (7.9), j' 1 (x) is the conserved particle number density current (antiparti¬ 
cles being counted as negative), and qj^ (x) is the conserved charge density current. 
Thus the Lagrangian density (7.7) includes the electromagnetic field produced by 
the charged particle current as well as the field produced by external sources. 

Setting q = the electron charge = —e, and m to be the electron mass, the 
Lagrangian (7.7) is, after quantisation, the Lagrangian of quantum electrodynamics. 
With the external charge-current distribution J 11 (x) taken to be that of the atomic 
nuclei, and including the dynamics of the nuclei as an assembly of point particles, 
this is the basic Lagrangian that describes and explains most of chemistry and 
materials science. We shall review some of the astounding successes of quantum 
electrodynamics in the next chapter. 
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7.3 Gauge transformations and symmetry 


In Chapter 4 we stressed that the four-potential A ;/ is not unique: the same physical 
electric and magnetic fields are obtained after a gauge transformation 

(x) -* A' jX (x) = (x) + d ll x (x) 

where / (x) is an arbitrary function of space and time. 

If \[r is a solution of the Dirac equation with the four-potential A jJL , the corre¬ 
sponding solution in the gauge with four-potential A' ; is given by 

\[r —> \j/' = e.~' qx \jr. 


This is easily verified: 

(i9*i - qA'J = e~ lqx {i3 /; + qd^x ~ + 9^x)} f 

= e.-' qx (\d /x - qA^xj/. 

Hence the Dirac equation (7.6) is equivalent to 

[Y^Wn-qA'^) - m\ f = 0 . 

The transformations: 


A ;1 (x) -* A,j. (x) + 3 m x (x) (7.10a) 

^ (jc) ^ e“ i<?x(jr V (jc) (7.10b) 

make up a general local gauge transformation. 

The charge-current density qj M = q\jry ll \jj is invariant under the transformation 
and so too is the action provided that (as in Section 4.3) d fl J 11 — 0. It is also 
interesting to note that the phase of a charged Dirac field, for example that of an 
electron, is a gauge artefact without physical significance: this phase cannot be 
measured. 

We can look at this transformation from a different point of view. The Lagrangian 
(7.7) is invariant under the global 1/(1) transformation \js —»■ x/r 1 = e~ 10l tfr where 
a is constant. If we now ask for the Lagrangian to be invariant under a similar 
but local transformation, \jt —> \j/' (x) = e~ lqx \j/{x), where /(jc) is an arbitrary 
function of space and time, we are forced into introducing the gauge field A^, 
with the transformation property A tl —»• A 1 = + 3 M /, in order to cancel out 

the additional terms which arise. 

From this point of view, the electromagnetic field appears as a consequence of 
the invariance of the Lagrangian under a local symmetry transformation. This idea 
will be generalised in later chapters. 
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Charge conjugation is the operation of replacing matter by antimatter so that, for 
example, an electron is interpreted as the antiparticle of the positron, which is then 
the particle. This would be the natural point of view if the Universe contained anti¬ 
matter rather than matter. An interchange is achieved if we replace the Dirac field 
by its complex conjugate. Consider a positive energy solution of the field equation 
that has a phase factor e _l£r . After complex conjugation it has a phase factor e l£f , 
and with the standard phase convention is a negative energy solution. In the ‘hole’ 
interpretation, negative energy solutions are associated with antiparticles. How¬ 
ever, the operation of complex conjugation does not leave £ invariant: additional 
manipulations are needed to display the symmetry. 

Taking the complex conjugate of the Dirac equation (7.6) gives 

\iY‘ i n-Mn-qA l t)-mW* = 0 . 

Now in the chiral representation y°, y 1 and y 3 are real and (y 2 )* = —y 2 . Multi¬ 
plying the equation above by y 2 and using the anticommuting properties of the y 
matrices gives 

+ qA^) - m](yV*) = 0, 


or 

[y 11 (i^ - qA^) - m] (y V*) = 0. 

Hence if r/r is a positive energy solution of the Dirac equation for a particle carrying 
charge q, (y 2 r/r*) is a negative energy solution in the charge conjugate field 
A c = — Afj,, which we introduced in Section 4.6. 

There is some freedom of choice in the details of the transformation. We shall 
define the charge conjugate field xjf c by 

VA = -iyV* (7.11a) 

or, in terms of two-component spinors 

x[r[ =-ia 2 i/s*, j/ ( R = in 2 !//*. (7.11b) 

Using(y 2 )“ = —I, (y 2 )* = — y 2 , we can invert the transformation (7.11a), 
obtaining 

is = —iy 2 (i/A)* (7.12a) 

or 

Vt = -icr 2 (>r)* , x/f R = ia 2 . (7.12b) 
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Then (noting (y 2 )t = — y 2 ) we have 

= —i(i (f c ) T y 2 (7.13a) 

or 

fl = i(KY a2 ’ V4 = -MlY® 2 - (7.13b) 

Let us see how the various terms in the Lagrangian density (7.7) transform. Con¬ 
sider 

i/n/r = = — (V rC ) T y 2 y 0 y 2 (V rC )* = — (V fC ) T y°( i A' : )*> 

(using the properties of the /-matrices). 

To display the invariance of £ we must anticipate Chapter 8. As operators, 
spinor fields anticommute: if a product of two fields is interchanged, a minus 
sign is introduced. For example, \lf a *^b = Thus in transposing the last 

expression above we introduce a minus sign, and hence recover the form of the 
original term: 

\jr\lr = (\[r c )x// c 

(since (y°) T = y°). 

Other terms likewise acquire a minus sign: 

\Iry^i/f = — (V' ,c ) T y 2 y < V M y 2 (V' ,c )* 

= (^ c ) t (y 2 y°y' i y 2 ) r (V fC )- 

But, as the reader may verify, 

(y 2 yW) T = -y V- 

Hence 

{[ry^ijr = — (V fC )y /i (V' ,C )- 

Finally, 

xfry^id^xl/ = -(f c ) T y 2 y°y' 1 y 2 id^xl/ c )* 

= c ) t (y 2 y°y M y 2 ) T (iA c ) 

= —i3 

Integration by parts in the action allows us to replace this last term by (i/C)y M i3 M (i/C) 
in the Lagrangian density. 

The Lagrangian can be seen to be of exactly the same form after charge conjuga¬ 
tion, provided that the charge conjugate potentials A c are defined to be A c — — A^ 
(as in Section 4.6) and any external charge-current density J fl also changes sign. In 
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ordinary matter, where the Dirac particles are electrons, the external 7 /( arise from 
the atomic nuclei, and these currents also change sign under charge conjugation. 


7.5 The electrodynamics of a charged scalar field 

In Section 3.5 we introduced the Klein-Gordon equation, 

— d^d^cj) — m 2 (j) = 0 , 

which describes the motion of an uncharged scalar particle. The corresponding 
equation for a charged scalar particle is obtained from the Klein-Gordon equation 
by making the substitution (7.5), i3 M —> i3 M — qA which gives 

[(i3 M - qAJiW - qA») - m 2 ]<t> = 0. (7.14) 

A solution of (7.14) is necessarily complex. Thus a charged particle of zero spin 
in an electromagnetic field must be described by a complex, or two-component, 
wave function 0 = ( 0 ! -f- \fy£)/\fl. We introduced complex scalar fields in Section 
3.7. A real Lagrangian density that yields (7.14) and is Lorentz invariant is 

£=- [(i3 M + qA^y D*] [(19^ - qA 11 ) <D] - (7.15) 

£ is invariant under a local gauge transformation, <t> —> e~'' ,x <t>. Note that, since 
zero spin particles are bosons, the fields <E> and <t>* commute. 

Taking the complex conjugate of equation (7.14), we see that if (x) is a solution 
for a particle carrying charge q in a given external field, then <E>*(x) is a solution 
for a particle carrying a charge —q. We define the field 4> c (x) = <t>* (x) to be the 
charge conjugate of <J>. The Lagrangian density (7.15) is invariant under charge 
conjugation, <t> —> <fi' , if the charge conjugate potentials are again defined to be 

K = ~ A »- 

The charged n + and n mesons are composite, spin zero, particles whose overall 
motion is described by the generalised Klein-Gordon equation (7.14). We shall 
meet these particles and the fields <t> and O* in the phenomenological discussions 
of Chapter 9. 


7.6 Particles at low energies and the Dirac magnetic moment 

In an electromagnetic field, the coupled Dirac equations (5.10) become 

(i3o - qA 0 ) i/t - cr' (i3/ - q A,) i/t - tm/r R = 0 
(i3 0 - qA 0 ) \J/ R + a 1 (i3, - qA t ) i/q* - m\j/ h = 0 


where the a' are the Pauli spin matrices. 
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From Section 6.1 , solutions of the Dirac equation that correspond to particles 
at low energies have 0 L ~ 0 r. We shall now show that at low energies the two- 
component wave function 

0 = e' mt (0 L + V'r) (7.17a) 

corresponds closely to the Schrodinger wave function for the particle. The factor 
e" nt has been inserted so that, as in the Schrodinger equation, the rest mass energy 
of the particle is omitted. If we define the orthogonal combination 

/ = e' mt (V/l - 0r) , (7.17b) 

then by adding and subtracting the equations (7.16) we obtain an equivalent pair of 
equations: 

(i3 0 — <?A 0 ) 4> — cr 1 (idi -qA t )x = 0, 

(7.18) 

(i3o - q A 0 + 2m) x - cr' (i3, - q Ai ) 0 = 0. 


The Schrodinger equation results if the term (i3o — q A () + 2m) / is replaced by 
2m x- This approximation is reasonable if the Coulomb potential energy qAo and 
the kinetic energy are small compared with the rest mass of the particle. Then 

X = (l/2m)or' (i3, - qAj)4>, 


and by substitution 


. 30 
1_ 37 ~ 


- —cr 1 (idi - qAi)(T J (idj - qAj) + qA 0 
2m 


0 . 


(7-19) 


The Pauli spin matrices have the property 

a‘a j = i e ijk cr k + 
and from the antisymmetry of e,jk, 

Sijkdidjfi = 0 , EijkAjAj = 0 . 


Also s ijk [di(Aj(P) + A/3,-0] = e ijk [di(Aj(p) - A ; 3,-0] = £, 7 *(3,A J )0, and recall¬ 
ing A,, = (0, —A), Sijk(diAj) = Rj; = —B k gives the magnetic field B. Using 
these results, we write (7.19) as 


30 

l— 

3 1 


—— (—iV—gA ) 2 + q A Q — (x~) • 
2m V 2m / 


B 


0 . 


(7.20) 


Without the term — (qa/2m). B, this would be the Schrodinger equation for a 
charged particle in an electromagnetic field. The additional term we interpret as the 
energy in a magnetic field of an intrinsic magnetic moment associated with a Dirac 
particle. This is another remarkable consequence of the Dirac equation. For an 
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electron, with q = —e, the magnetic moment is the Bohr magneton /r B = eh/2m, 
anti-aligned with the electron spin. The observed magnetic moment agrees to better 
than 1% (cf. Section 8.5). 

At the level of approximation of (7.20), the magnetic moment would play no 
role in a purely electrostatic field A 0 . In better approximations, or indeed solving 
the Dirac equation directly, ‘spin-orbit coupling’ terms appear, which are of some 
importance in atomic physics and materials science. 


Problems 

7.1 Using the plane wave expansion (6.24), show that the conserved particle number can 
be written 

f P(x°,x)d 3 x= f i/ rt V f d 3 x = ^(h* £ i7p £ + d p£ d* £ ). 

^ J p,£ 

7.2 Show that the charge conjugation operation acting on the positive energy solutions 
(7.12) and (7.13) yields the negative energy solutions (7.17). 

7.3 Show that, taking the fields to be anticommuting and neglecting the neutrino mass, 
the neutrino Lagrangian density 

£ = 

is invariant under the combined operations of parity and charge conjugation. (Note 
equations (5.26) and (5.27).) 

7.4 Show that i cr 2 i Jr^ transforms like a left-handed spinor under a Lorentz transformation. 

7.5 Obtain the Klein-Gordon equation (7.14) from the Lagrangian density (7.15). 

7.6 Using the method of Section 7.1, show that the global t/( 1) symmetry <t> —> e l “<t> of 
the Lagrangian density (7.15) leads to a conserved charge density current 

qj fl = ig[cp*(9"0) - (9"<J>*)cJ>] - 2q 2 A"4>*<t>. 

(Note that, in contrast to the result (7.9) for the Dirac Lagrangian, the current of a 
complex scalar field contains a term proportional to A 1 ' .) 

7.7 Show that for the positive energy solutions (6.12) and (6.13) of the Dirac equation, 

qj M = — ei/ry M i/f = — e (coshd, 0, 0, sinhd) = — ( eE/m ) (1, 0, 0, u) 
and also for the ‘negative energy’ solutions (6.17), 

qj 11 — — (eE/m)( 1, 0, 0, v ) . 
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With Dirac’s interpretation, the hole that remains when this state is removed from 
the sea corresponds to a particle carrying charge e moving with velocity v along the 
Z-axis. 


7.8 Show that after the operation of charge conjugation a proton has negative charge and 
an electron has positive charge. 

7.9 How do the electromagnetic potentials transform under the operation of time reversal, 
t —> t' = —f? Show that y 1 ]/ 3 ^* (f) is a solution of the time reversed Dirac equation, 
if rfr (t) is a solution of the Dirac equation. 


7.10 Show that, for a Dirac particle in a magnetic field B given by the vector potential A, 
both \j/ l and i/Vr satisfy the equation 


r- - (-iV - qA) 2 - m 2 + qa ■ B 
ot A 


f = 0 . 


Note that this differs from the Klein-Gordon equation for a charged scalar particle 
in a magnetic field, by the additional term qcr ■ B. 


7.11 Using the parity transformations (4.18) and (5.27), show that the Lagrangian density 
(7.7) is invariant under space inversion. 
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Quantising fields: QED 


We turn now to the quantisation of the electrodynamic fields introduced in 
Chapter 7. So far we have treated the electromagnetic field and the Dirac field 
as classical fields (though we were compelled in Chapter 7 to recognise that Dirac 
fields anticommute). On quantisation, these fields become operator fields, acting 
on the states of a system. The classical total field energy becomes the Hamiltonian 
operator, which determines the dynamics of the system. We shall use the formal¬ 
ism of annihilation and creation operators; this formalism is reviewed briefly in 
Appendix C for readers not already familiar with it. 

Quantum electrodynamics, or QED, is an important component of the Standard 
Model. It is also the foundation of our understanding of the material world at the 
atomic level. However, we do not wish to enter into the technical complications 
of electrons in atoms or in material media. In this chapter we shall only con¬ 
sider more simple situations of a few interacting photons, electrons and positrons, 
at energies sufficiently high for bound systems of electrons and positrons to be 
ignored. In these situations, the free field approximation to QED provides a sound 
basis for understanding the interactions of particles as perturbations on their free 
behaviour. 

This is not a text on quantum field theory, and our outline of perturbation theory 
in this chapter is necessarily sketchy. But our intention is to try to give some insight 
into how the results of calculations, presented in later chapters, are arrived at. We 
shall attempt to explain the necessity of renormalisation, which is an important 
concept in the formulation of the Standard Model. 


8.1 Boson and fermion field quantisation 

The simplest classical field we have introduced is that of a massive free scalar 
particle. It satisfies the Klein-Gordon equation (3.19). In the field expansion (3.21) 
we have so far regarded the classical wave amplitudes a k and as ordinary complex 
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numbers. We now quantise the theory. We interpret a k as an annihilation operator 
and a'f. becomes the creation operator a k , the Hermitian conjugate of a k . These 
operators are to obey the commutation relations 

[a k , a[,] = 5 kk/ , [a k , a w \ = 0, [a k , fl k ,] = 0. (8.1) 

The total field energy (3.30) becomes the Hamiltonian operator 

H = ^ a\.a k co k = ^ A^k, (8.2) 

k k 

where a> k = y^k 2 + m 1 ) and it follows from the commutation relations that N k — 
a k a k is the number operator (Appendix C). As in Chapter 3, we shall in this chapter 
confine all particles to a cube of side Z, volume V = l 3 , and use periodic boundary 
conditions. By defining the Hamiltonian to be of the form (8.2), rather than the 
more symmetrical form 

^ ^ (4<a k + n k 4) w k = ^ ^Vk + ^ «k (8-3) 

we discard ‘zero-point energy’ contributions and hence make the energy of the 
vacuum state |0) to be zero. The excited energy eigenstates of the Hamiltonian can 
then be interpreted as assemblies of particles (7t° mesons, say, or Higgs particles) 
with an integer number n k of particles in the state k, where n k is the eigenvalue of 
the number operator A k . The particles will obey Bose-Einstein statistics. 

In the radiation gauge of Section 4.1, the electromagnetic field in free space is 
quantised in a very similar way to the Klein-Gordon field. The wave amplitudes a kc/ 
and a kc/ which appear in the expansion (4.15), become the annihilation and creation 
operators a ka and a ka , and the total field energy (4.25) becomes the Hamiltonian 
operator 

H-em — ^ ' Gv:Tk ; < tO k (8.4) 

k,o; 

where a> k = |k|. The operators a ka and a ka annihilate and create photons of wave 
vector k and polarisation a, and satisfy commutation relations 

[^k(oi ^k'ff'] = ^kk'^cra'i [Aka? ^k'a'] = 0, [^ka’ ^k'a'] = (8-5) 

N (k, a) = a ko[ a ka is the number operator. The energy eigenstates of the radiation 
field correspond to assemblies of photons. Photons, like scalar particles, obey Bose- 
Einstein statistics. (See Problem 8.1.) 

On quantising the Dirac field of a free electron, the wave amplitudes appearing in 
the expansion (6.24), and their complex conjugates likewise become operators: b pE 
and bpj annihilate and create electrons of momentum p, helicity e; d pf and d p ,' 
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annihilate and create positrons of momentum p, helicity e. Electrons and positrons 
are fermions, and these operators obey anticommutation relations, for example 

b pr b p's' + b p , e ,bp £ = \bp S , b p's'\ = ^pp'^se'j {^p£’ ^p'e'} = 0> {^ps’ b p's'\ = 0 

( 8 . 6 ) 

d pe and d p , f;l obey similar rules. Also all electron operators anticommute with 
all positron operators. The electron number operator N e (p, e) = hp, : b pf and the 
positron number operator N p (p, e) = dp, dp, have possible eigenvalues restricted 
to 0 and 1, in accord with the Pauli exclusion principle (Appendix C). Electrons 
and positrons obey Fermi-Dirac statistics. (See Problem 8.2.) 

After second quantisation, the difficulties that were associated with the interpre¬ 
tation of the Dirac equation as a single particle wave equation disappear. Elec¬ 
trons and positrons are now on a similar footing and the ‘sea’ of filled nega¬ 
tive energy states is no longer needed. The total field energy (6.25) becomes the 
Hamiltonian 

H = ippe b ps — dp £ dp E )Ep. 

p.s 

Using an anticommutation relation, we can replace this by 

H = (bp l: bp, : + dp f dp, — I) Ep. 

p.s 

We shall discard the constant zero-point energy term (which we note is negative 
for fermions) and take 

H — {bp E bpe + d^ £ dps)Ep. (8.7) 

p,« 

The energy of the vacuum state is then zero, and the excited states correspond to 
assemblies of electrons and positrons. 

Similarly, the field momentum (6.26) becomes the momentum operator 

p = ( h P> b pr + dl E dps)p. (8.8) 

p,£ 


The conserved particle number (Problem 7.1) becomes the time independent 
operator 


f p (*°< x ) d3x = ( b w b ps + dp f: d ] p f ). 

J p, £ 


which we replace by: 

conserved number operator = I] ( b l bps + d 'p. dp,). 


( 8 . 8 ) 


(8.9) 


p.£ 
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This operator counts the number of electrons minus the number of positrons, a 
number which is therefore constant in quantum electrodynamics. 


8.2 Time dependence 

In the Schrodinger picture, a system described by a Hamiltonian H evolves in time 
from a state |f 0 ) at time t 0 to a state \t) at time t, where 

1 1) = e- iH(t ~ to) \to). 

Thus time displacements are generated by the unitary operator c~' Hl . 

The expectation value of a time independent operator O at time t is 

(t\0\t) = (t 0 \e lH(, - to) de~ m, ~ ,o) \t 0 ) 

= (tolOnit — to)|fo) 


where 

0 H (t) = e iHt de~' Ht (8.10) 


depends on t. 

These last equations give the so-called Heisenberg picture, in which the states 
of a system remain fixed and the operators become time dependent. In the case of 
free fields, the time dependence of the annihilation and creation operators is very 
simple. For example, in the case of a scalar field (see (3.21)), 

a k {t) = e~ iWk, a k , 4(t) = e imkt a{, (8.11) 

as may be seen by considering the effect of the operators on a state \n k ) (Appendix 
C). It is usual in quantum field theory to work in the Heisenberg picture. 

In the case of interacting fields, the basic free field states we have defined are no 
longer eigenstates of the total Hamiltonian. In QED we may write 

H = H 0 + V, (8.12) 


where 


Hq = H (photons) + H (electrons) + H (positrons) 

is given by (8.4) and (8.7). The eigenstates of H 0 are just collections of freely 
moving photons, electrons, and positrons. 

V comes from the term —q A M in the Lagrangian density, (7.7), 

which we constructed in Chapter 7. We are here excluding external fields. Since 
V does not depend on derivatives of the fields, its contribution to the energy 
density r o ° is just q(\jry^xl/)A^, and setting q = — e for electrons we obtain 
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at t = to 

V(to)=-ej \j/(r,t 0 )y ll \lt(r,t 0 )A ll (r,t 0 )d i r. (8.13) 

Note that the subsequent time development of the fields is not that of the free fields, 
since it is determined by the full Hamiltonian H = Ho + V. 

We can expand the fields A tl and i// at the initial time t 0 using (4.15) and (6.24), 
replacing the wave amplitudes by appropriate operators. On expanding out V there 
will be several types of term. For example, setting t 0 — 0 one can easily pick out a 
term 


em 


yJdV ojk E p- E p ") 


[u e '(p')y^u e Ap")£^]d', e d'ak a S (k -p'-p").o- (8-14) 


This term annihilates a photon and creates an electron-positron pair. The condition 
k — p' — p" = 0 comes from the integration over space of the exponential factors, 
and explicitly conserves momentum. 

Dynamical calculations in a quantum field theory can be viewed as the calculation 
of the unitary operator e~ lHr acting on some initial specified state. In QED, the 
coupling (8.13) between the radiation field and the Dirac field is determined by the 
charge on the electron e. It is natural to introduce the dimensionless parameter a, 
the fine structure constant. 

e 2 1 

a = -~ -. 

47 The 137 


a characterises the strength of the coupling, and is small. Much progress has been 
made in QED by the construction of the operator e~ lHt as an expansion of the form 


e~' Hr = e-' Ho, [l + eOi (; t ) + e 2 6 2 (f) + ...] (8.15) 


where the O n {t) are time-dependent operators. 


8.3 Perturbation theory 

To construct the perturbation expansion (8.15), one can start by considering 

e ~' Ht = [ e -i ™]"with<5t = t/n. 

For large enough n (small enough 8t ), one can take 

e~ iHSt = 1 - i H8t 

and discard higher order terms in the Taylor expansion. Then 

e~ iHt = [1 -i {Ho + V)8tf . 

~PuH.C- TfcatAcMotica/ ~Ph.yS.LC-A. 





82 Quantising fields 

In the lowest order of perturbation theory only the terms linear in V are kept, so 
that 

n —1 

e~ lHot eO\ (0 = tl - iH 0 St] n ~ 1 ~ r VSt [1 - iH 0 St] r 

r =0 

= -i^e“ iffo(W )v<5te“ iHor ' 

r =0 


with t' = r8t and n large. 

In the limit of 8t —> 0, we can replace the sum by an integral, so that 


eO x {t) = 


l 

= -if dlt'e iffV 


Fe 


-iflo*' 


(8.16) 


The operator e lHot ' is the simple free field time evolution operator. If we take V to 
be given at t = 0 by (8.13), we can write 

t 

0! (0 = i J V/(r', t")y IJ il/(r', t') df'dV (8.17) 

o 

where the fields have the time dependence of free unperturbed fields. A term like 
(8.14), for example, will have time dependence (see equation (8.11)). 

e —i(" k — £ P '— £ P ">' (8.18) 


The evolution of a state from time —t/2 in the past to time t/2 in the future 
corresponds to taking the integral in (8.17) from —t/2 to t/2. This more symmetrical 
form is appropriate to the description of particle scattering processes. For example, 
if the initial state at time —t/2 consists of aphoton in the state (k, a), the operators in 
(8.14) annihilate this photon and create an electron in a state (p', s') and a positron 
in the state (p", e"). Taking the limit t —> oc in the time factor (8.18) gives 

OO 

J e -i(«k-Hp'-Vk'dt' = 2ir8(to k - E p - E p ). 

—OO 

Thus energy conservation, as well as momentum conservation, is explicit. In free 
space it is impossible to satisfy both these conservation laws in the case of pair 
production from a photon (Problem 8.3), so that first-order perturbation theory con¬ 
tributes nothing. (In the presence of an external electromagnetic field, for example 
the Coulomb field of a nucleus, momentum conservation between electrons and 
photons is lost, and pair production is possible if > 2m.) 
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When the first-order transition amplitude at time t does not vanish, we have, 
using (8.16), 


(final state|eO i(t)|initial state) = (f|V(0)|z) 


t/2 

I e~ iAEt 'dt', 


-t/2 


where A E = E\ — Ef and E, and E f are the energies of the initial state |i) and final 
state |/). It is shown in textbooks on quantum mechanics that the time dependence 
can be interpreted as a transition probability per unit time, from the initial state i to 
the final state f, given by 

transition probability = 27r|(f|I / (0)|z)| 2 /5(£'f), 
whcrcp(£ t )is the density of final energy states at E t = E\. 


It is straightforward to extract higher order terms of the perturbation expansion. 
For example 

t/2 t2 

0 2 (t) = J d 4 x 2 xf(x 2 )y fl xls(x 2 )A lx (x 2 ) J d 4 xi xj/ (x^A^ixi) 

-t/2 -t/2 

(8.19) 

where x\ = (q, iq), x 2 = {t 2 , r 2 ) and —t/2 < t 2 < t/2. 


8.4 Renormalisation and renormalisable field theories 

In second-order perturbation theory, we can pick out terms corresponding to the 
creation of an electron-positron pair at a point x\ in space-time and its destruction 
at a point x 2 . They may be characterised by the diagrams of Fig. 8.1. In these dia¬ 
grams time runs from left to right. Momentum is conserved at x\ and x 2 . Overall 
there is also conservation of energy and angular momentum, so that the ‘unper¬ 
turbed’ photon that emerges at time t 2 is in the same state as the initial unperturbed 
photon. 

We pointed out that in free space it is not possible to create a real e“e + pair from 
a photon. The e“e + pair of the diagram is a virtual pair, corresponding to a term in 
a mathematical expansion. The transition amplitude 

(k\e~ iHo, d 2 (t) |k) = e- ,w (k| 0 2 (i t ) |k) 

is non-vanishing. The ‘real’ photon is evidently a complex object. Calculations 
show that the effect of virtual e“e + pairs is to make the vacuum behave like an 
electrically polarisable medium. In particular, the Coulomb interaction between 
two ‘bare’ electrons is screened. We can envisage this effect as resulting from a 
screening cloud of virtual positrons around each bare electron, the corresponding 
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(a) 




(c) 

Figure 8.1 In these diagrams an unperturbed electron-positron pair is created at 
a point X\ in space-time and destroyed at a point x 2 . In (a) the initial unperturbed 
photon is destroyed at x\ and recreated at x 2 ; vice versa in (b). In (a) and (b) 
time runs from left to right. As shown by Feynman it is convenient to characterise 
both processes by the single Feynman diagram (c). In all of these diagrams the 
arrows on the fermion lines follow the direction of electron number. (The arrows 
on positrons then run backwards in time.) 


negative charge of the virtual e“e + pairs appearing as charge at the surface of the 
confining volume. 

What is measured experimentally as the charge — e on an electron is the screened 
charge. To compensate for this screening effect, the parameter e that appears in the 
Lagrangian must be replaced by a ‘bare’ charge e 0 = e + Ac. This gives ‘counter 
terms’ in the Lagrangian. Ae is chosen to cancel the screening effect. To second 
order the calculation gives Ae = a.A\e where A\ is a dimensionless quantity. With 
this adjustment and to this order, the screened charge on the electron becomes —e. 
In higher orders of perturbation theory one obtains 

Ae = e[aA\ + cr A 2 + ••■]• 

To any order of perturbation theory an account must be kept of the readjustment 
of e, in order to extract from a calculation the significant physical effects which 
are also determined by terms in the perturbation expansion. The charge -e on 
the electron is said to be renormalised. Ae itself can never be measured. Physical 
effects in atomic physics arising in part from vacuum polarisation terms have been 
calculated and measured with high precision. (See also Section 16.3.) 

The other parameter appearing in electrodynamics is the mass of the elec¬ 
tron. The bare mass of the electron is modified in second-order perturbation 
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v. 



(c) 

Figure 8.2 In these diagrams an unperturbed photon is created at a point x\ in 
space-time and destroyed at a point x 2 . In (a) the initial unperturbed electron is 
destroyed at x\ and recreated at x 2 ; vice versa in (b). In (a) and (b) time runs from 
left to right. It is convenient to characterise both processes by the single Feynman 
diagram (c). In all of these diagrams the arrows on the fermion lines follow the 
direction of the electron number. (The arrows on positrons then run backwards in 
time.) 


theory by the processes shown in Fig. 8.2. To compensate for these processes 
we must take m 0 = m — Am in the Lagrangian where Am is chosen to compen¬ 
sate for the shift in mass produced by the electron-photon interactions. We can 
think of the bare electron as ‘dressed’ by virtual photons. It is found that to sec¬ 
ond order Am = amB\, where B\ is another dimensionless quantity, and more 
generally 


Am = m[aBi + oT Bi + ■■•]• 

As with Ae. Am has to be adjusted at each higher order of perturbation theory, 
and there is a systematic way of extracting physical answers from perturbation 
calculations. The physical mass m is the renormalised mass. 

Diagrams like those of Fig. 8.3, in which virtual e“e + pairs and virtual photons 
are created and annihilated together, give terms that modify the vacuum energy. 
Energy shifts in perturbation theory are to be expected, but since we have no 
unperturbed vacuum with which to compare, such shifts are not measurable. The 
cosmological constant of general relativity gives a measure of the vacuum energy 
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Figure 8.3 The vacuum state of quantum electrodynamics differs from the unper¬ 
turbed vacuum by processes, one of which is illustrated in this figure. 


density that is certainly very small, and is consistent with its being zero. We shall 
take the vacuum energy density, whatever its origin, to be zero. 

It could have been anticipated without calculation that there would be perturbing 
effects of charge renormalisation and mass renormalisation. The unpalatable feature 
of quantum electrodynamics is that when the constants A;, and B, are calculated 
they all turn out to be infinite, as does the correction to the vacuum state energy. It 
is just as well that Ae and Am have no physical significance. However, it is the case 
that an expansion in the small parameter a gives seemingly infinite corrections to 
quantities one cannot measure. An important feature of QED is that, leaving aside 
a scaling of the fields that is also part of the renormalisation scheme, infinities only 
appear in the renormalisation of the parameters of the theory, e, m and the vacuum 
energy. The only infinite counter terms that have to be added to the Lagrangian 
are contained in these parameters. Having made these adjustments, the remaining 
physical effects are calculable and finite. 

QED is a local field theory, i.e. a theory in which the interaction terms involve a 
product of fields at the same point in space time. Infinities such as occur in QED 
are endemic in all local field theories. Field theories in which the infinities only 
appear in a finite number of parameters of the theory are said to be renormalisable. 

The divergences in the coefficients A, of Ae and B, of Am arise, for example, 
in the contribution from Oo (see (8.19)), from the integration region where vi ~ *1 
and in particular where r 2 ~ iq. An important feature of QED is that the expansion 
parameter a and hence the coefficients, are dimensionless numbers. In Chapters 9 
and 21 we will encounter theories in which the coupling constants and therefore 
the expansion parameters have the dimensions of inverse powers of mass. All 
the terms in perturbation expansions must have the same dimension, therefore the 
coefficients have a dimension to compensate those of the coupling constant. In the 
integration regions the integrands diverge with large inverse powers of |ri — iq | as 
ri —> r! to achieve the compensation, but they render the integrals infinite. Infinities 
occur for all multiparticle interactions, they can not be removed just by mass and 
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coupling constant renormalisation. Such theories are unrenormalisable, they can 
not be taken seriously as quantum field theories. 


8.5 The magnetic moment of the electron 


We shall now illustrate the remarkable success of QED in calculating quantities 
of physical significance by giving an account of the calculation of the electron’s 
magnetic moment. In Chapter 7 we showed that the Dirac equation before second 
quantisation implies that the electron carries a magnetic moment of magnitude 
fi\i = eh/2m anti-aligned with its spin. The electron’s magnetic moment has been 
measured with high precision: the experimental value /x e is 


Me = Mb (1 + a) 


where the ‘anomaly’ a = 0.001159 652 188 4(43) (Van Dyck et ciL, 1987). 

After second quantisation, the perturbative corrections to the Dirac value can be 
calculated. The Dirac value is contained in the operator O \ of equation (8.16), and 
is associated with diagram (a) of Fig. 8.4. This lowest order calculation reproduces 
the Dirac result Me = Mb- 

Since /r B is the only combination of the parameters e, m e and h which has the 
dimensions of magnetic moment, higher orders of perturbation theory will give 
terms of the form 


Me — Mb(1 + otC\ + a~Ci + oc’C'i + ci^Ca, + ■■■)> 


where the C, are dimensionless constants. To compare the theory with experiment 
we use the 1986 adjusted value of the fine structure constant, 


a -1 - 137.035 9979(32). 


Ci is associated with diagram (b) of Fig. 8.4; the calculation gives Ci = 1/{2tt). 
Hence to this order 


a = Cia - 0.001 16140974, 


which agrees with experiment to within five significant figures. 

The next order correction, associated with diagrams (c) of Fig. 8.4, is 



where /(z) is the Riemann zeta function. To this order, 


a = 0.001 159 637 44, 


in agreement to seven significant figures. 
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electron 


electromagnetic 

field 





Figure 8.4 Perturbation theory Feynman diagrams that represent contnbutions to 
the electron magnetic moment. The anomalous moment, to order a 2 , comes from 
calculations associated with diagrams (b) and (c). 
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Calculations of higher orders of perturbation theory become rapidly more 
intractable. Numerical estimates give C 3 ~ 0.03792, C 4 ~ —0.014. At this level 
of accuracy, corrections have to be made for processes that come from other parts 
of the Standard Model, in particular from the muon. The most recent comprehensive 
calculations (Kinoshita and Lindquist, 1990) give 

a = 0.001 159 652 1400 (41 + 53 + 271), 

in agreement with experiment to ten significant figures. The largest error in the 
theory is from the uncertainty in a -1 . 

Within its range of applicability, quantum electrodynamics provides an aston¬ 
ishingly exact model of Nature. One may have some confidence that the techniques 
of renormalisation in perturbation theory are valid. 


8.6 Quantisation in the Standard Model 

In this chapter we have outlined the ‘canonical quantisation’ techniques that have 
been particularly successful in quantum electrodynamics. Many books have been 
written on this subject, for example Itzykson and Zuber (1980); some will have to 
be consulted if one is to be competent and confident in making detailed calcula¬ 
tions. However, many of the decay rates and cross-sections given in the following 
chapters, which are needed to compare the predictions of the Standard Model with 
experiment, are quite well approximated by the so-called ‘tree level’ of perturbation 
theory. The tree-level diagrams have no closed loops (see Fig. 8.4(a)) and require 
no renormalisation. It is a fortunate circumstance that in low orders of perturbation 
theory these can be calculated quite easily. 

The particles and forces of the weak and the strong interactions are also described 
by local gauge field theories, which will be exhibited at the classical level in the 
chapters that follow. The quantisation procedures used in these extensions of QED 
have been most successfully pursued by the path integral method of quantisation 
(see, for example, Cheng and Li (1984)). Both the theory of the weak interaction 
and the theory of the strong interaction pose their own special problems, but the 
principles of gauge symmetry and renormalisability have been essential in the 
construction of the Standard Model as it is today. 


Problems 

8.1 A general two-particle state of scalar bosons (Section 8.1) can be written 

|state) = ^/(kLk^aklO), 

ki.k. 
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where, apart from normalisation, /(kj, k 2 ) is any function of k! and k 2 - if can be 
called the wave function of the state.) 

Show that this state may be written 

I state) = ^ g(k 1 ,k 2 )flj tl flk 2 l 0 > 

ki,k 2 

with ^(ki, k 2 ) = {/(ki, k 2 )+/(k 2 , ki)}/2, symmetric under the interchange of 
labelling. 

8.2 A general two-particle state of fermions can be written 

|state) = J2 f(Pi’ E i’P2,S2)bl Sl bl ei \0) 

Pl»Sl>P2.S2 

where apart from normalisation/is any function of pi, £i and p 2 , Ei. 

Show that this state can also be written 

|state) = J2 8 (Pi> e i> P 2 . £ 2)^p iei ^p 2 s 2 |0) 

Pi,ei,p2,fi2 

with g(pi, gi;p 2 , £ 2 ) = {/Xpi, £i;p 2 , £ 2 ) —/(P 2 , £ 2 ;Pi, £i}/ 2, antisymmetric under 
the interchange of labelling. 

8.2 Use energy and momentum conservation to show that pair creation by a single photon, 
Y —> e + + e~, is impossible in free space. 

8.3 The energy density of an electromagnetic field is given by equation (4.24). Show that 
the total electric field energy of a point charge q outside a sphere of radius R centred 
on the particle is 

energy = q 2 /(8jzR). 

Note that this classical contribution to the particle rest energy is infinite in the limit 
R 0. 
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The weak interaction: low energy phenomenology 


In this chapter we review some of the early phenomenology of the weak interaction 
that played an important guiding role in the construction of the Standard Model. 
The phenomenology discussed is insensitive to the very small effects of neutrino 
mass. These effects will be ignored. 


9.1 Nuclear beta decay 

In early investigations of nuclear physics, the existence of a ‘weak interaction’ 
responsible for nuclear (3 decay was discerned. It was regarded as weak since the 
mean lives of decays such as 

9 ? F —> g 7 0 + e + + w e , 
n —p + e“ + "V e , 

are very long, minutes in these examples, compared with typical nuclear electro¬ 
magnetic decays, which have a mean life of ~ 10 - 15 s. 

Nuclear physicists have by careful and ingenious experimentation established 
the principal features of the weak interaction and the properties of the electron 
neutrino ~v e . To conserve electric charge the neutrino must be electrically neutral, 
and angular momentum is conserved if it is a Dirac spin \ fermion. If the electron 
neutrino has a mass, it is certainly very small. 

The surprising feature of the weak interaction, which was established experi¬ 
mentally in 1957 by Wu following a suggestion by Lee and Yang, is that it does not 
conserve parity. Nature is not ambidextrous. Indeed, parity is maximally violated, 
in that only the left-handed components of both the electron and neutrino fields 
participate in the interaction. 

This phenomenon is clearly illustrated if one examines the longitudinal elec¬ 
tron polarisation of electrons produced in ‘allowed’ (3 decays. An electron of 
negative helicity and velocity t; is in a left-handed state with probability 
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Figure 9.1 Measured degree of longitudinal polarisation P for allowed e decays. 
(Data from Koks and Van Klinken (1976).) 

|[1 + ( v/c )]; an electron of positive helicity +\ is in a left-handed state with 
probability + |[1 — (v/c)\ (Section 6.5). In allowed nuclear (3 decays there are no 
nuclear factors that favour one helicity state over another, so that if only the left- 
handed component of the electron field participates in the interaction, the degree 
of longitudinal polarisation of the emitted electron is 



For positrons, the probabilities are reversed (Section 6.5) and the longitudinal polar¬ 
isation of a positron emitted in an allowed (3 decay is +v/c. Data from several such 
decays are shown in Fig. 9.1. 

A direct measurement of the helicities of neutrinos emitted in (3 decay is almost 
impossible, but the helicities may be inferred from careful measurements of the 
angular momentum states of the participating nuclei. Within experimental error, 
only negative helicity neutrinos and positive helicity antineutrinos participate in 
the weak interaction. 

Nuclear (3 decays do not release sufficient energy to produce either of the two 
other lepton families known to exist: muons and muon neutrinos, and tau leptons 
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Figure 9.2 n~ -* e~ + t> e . In this illustration the electron velocity is to the right, 
the antineutrino to the left, the spin directions are indicated Any orbital angular 
momentum is out of the plane of the page (L = r x p) and since the total angular 
momentum must be zero the spins have to be opposite. 


and their partner neutrinos. We shall see in Chapter 13 that probably there are just 
these three, e, p, t, lepton families. Each family seems to play a similar role in 
Nature, an observation known as lepton universality. They differ only in the masses 
of the electrically charged leptons: ra e ~ 0.511 MeV, in u ~ 106 MeV, m T = 
Mil MeV. 


9.2 Pion decay 

An important example that illustrates both the left-handedness of the lepton fields 
participating in |3 decay and lepton universality is provided by the decay of the 
charged pi mesons. These decays are common in the cosmic radiation and provide 
its principal component, muons, at ground level. Almost 100% of the pions decay 
through 


7T -»• p +V^, 7t + -»p + +V^, 

with a decay rate 1/t (tt —^ pv u ) = 2.53 x 10“ 14 MeV. The corresponding 
decays to electrons have much smaller decay rates: 1/t {n —> cv e ) = 1.23 x 

10 4 (1 /t (7t -* PW». 

The decay rate to electrons is suppressed because only the left-handed fields of 
the electron and neutrino take part. Consider the tC decay in a frame in which 
the pion is at rest (Fig. 9.2). The n~ has zero spin, the antineutrino has positive 
helicity. Hence to conserve angular momentum in this two-body decay the electron 
also must have positive helicity. The probability of its being in the left-handed state 
is |[1 — (u e /c)] = m\l(rn\ + m 2 e ) = 1.34 x 10 -5 (Problem 9.1). The p“ decay is 
similarly inhibited, but the muon’s much larger mass makes the factor less effective: 
|[1 - (iv/c)l = 0.36. 

An effective interaction Lagrangian density that incorporates these features 
is 


£ int = a n [j^^ n + j^d^{\, 


(9.1) 
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where 


j 11 = ela^Vet + L + Tj'ct'Xl, (9.2) 

and a n is an effective (real) coupling constant. 

<S> n is a complex scalar field describing the charged 7T ± mesons (Section 7.6). 
(fi-r destroys negative pions, and creates positive pions. It is not a fundamental field 
of the Standard Model, since it ignores the internal structure of the pions. The 
four-vector is the simplest Lorentz structure we can construct from the 

two left-handed spinor fields, e L , l> cL , belonging to the electron and its neutrino (see 
Problem 5.3). Lepton universality is then incorporated in the model, the three lepton 
families contributing in a similar way to the ‘current’ j 11 ; this structure survives 
in the Standard Model. A Lorentz invariant £ int is obtained by taking the scalar 
product of j 1 ' with 9^ <f>, and, finally, we make £ int real. Note that £ int is a ‘point’ 
interaction: j 1 ' and 9 /( <f> are evaluated at the same point x in space-time. Since the 
pion is an extended object, this point interaction must be an approximation, not to 
be taken too seriously. 

An effective interaction Lagrangian is to be used only in low orders of perturba¬ 
tion theory. It is not suitable for calculating high order corrections. One should not 
therefore demand high accuracy when comparing the results of a calculation with 
experiment. 

Using our £ int to lowest order, the partial decay rates for pions at rest are (Problem 
9.4) 


1 _ a l ^ 

\v 2 E 1 - 1 

fl 

T(7T —> eWe) 47T V c ) 

t(7T — > qw^.) 47T 

l 1 c) 


(9.3) 

In these equations, E e , and p e , p t , are the charged lepton’s energy and momen¬ 
tum, and are determined by energy and momentum conservation. The factors 
p\E e , pj t E lt come from the density of states factor in the expression for the transi¬ 
tion probability (Problem 9.2). The factors (1 — v e /c) and (1 — v^ /c) are a conse¬ 
quence of the participation of left-handed fields only. 

The ratio 


————————4 = "jMztaE. = 1.28 x 10- (9.4) 

T (7T — ^ ev e ) rn^im^ - /njj) 2 

(Problem 9.3). This lowest order calculation, which neglects the effects of non¬ 
locality and electromagnetic corrections, agrees well with the experimental value 
of 1.23 x 10 -4 , and gives strong support for lepton universality. 
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The observations give 1/t(7T — > e'Ve) = 3.11 x 10 18 MeV, 1/t (7t —> p-v^) = 
2.53 x 10 -14 MeV, from which we may estimate 

a n = 2.09 x 10“ 9 MeV -1 . 

The smallness of a n reflects the weakness of the weak interaction. 

Although the pion does not have enough mass to decay to tau leptons, the effective 
Lagrangian (9.1) also described the decays 

T 4 " —> Tt r + W T , T - —> Tt~ + "V T , 


and in lowest order of perturbation theory, predicts 


1 


az 


t (t —> 7U> t ) 327t 


m\[ 1 - (m n /m r ) 2 ] 2 . 


(9.5) 


Using the estimate of from tt 4 decay to calculate 1/t (t —> 7tv T ) provides a 
further test of lepton universality: the predicted value 2.42 x 10“ 10 MeV compares 
quite well with the experimental value, (2.6 ± 0.1) x 10“ 10 MeV. 


9.3 Conservation of lepton number 

In the model Lagrangian discussed so far, a single lepton can change only to another 
of the same family, and a lepton and antilepton of the same family can only be 
created or destroyed together. There is thus a conservation law, the conservation 
of lepton number (antileptons being counted negatively), for each separate family, 
exemplified in the decays we have so far considered. 

We saw in Section 7.1 that particle conservation follows from a(/(l) symmetry 
of the Lagrangian, and it is interesting to see how this is accomplished with our 
model Lagrangian. We have 

£ = £, + £■ , 
tree 1 int 

where, using Dirac spinors for the lepton fields, 

£ free = dn&dPQ-mlrf® 

+ iAe(y /i i3 M - m e )^ e + 

+ ^(y^ - m^)Vv + iVT Mi9 M l V 
+ 1 - "It)Va + 

Ant = Ot n [j^ n + 

and, in terms of Dirac spinors, the current of equation (9.2) can be written 
j' 1 = Aey^i - k 5 K + 1 - v 5 ) v v + ^7^(1 - y 5 K- (9.6) 
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By itself, £ free has seven 17(1) symmetries: seven independent phases on the 
seven free fields. Including £ int reduces these to four, which can be written 

V^e -* e l|3 e'"' T c , v e -> e m ‘v e \ 

ifn -* e^e^vp^, V)JL _> 

\jr T -> e i|3 e i “ T 'h T , v T e io!T v T ; 

-> e i|3 d>„. 

The phase factors ay, a t ,, a x are associated with the conserved lepton currents 
(Problem 9.6). If we require £ to be invariant under a /oca/ gauge symmetry, with 
(3 — |3(jc) arbitrarily space and time dependent, we are led to the introduction of the 
electromagnetic field A ' 1 , as in Section 5.5. We shall see that not all these features of 
our effective Lagrangian survive the introduction of neutrino mass into the Standard 
Model. 


9.4 Muon decay 

The analysis of the muon decays 

P“^e“+w e +^, (x + -* e + +w e + 5^, (9.7) 

has played a very important role in establishing the Standard Model. The decays 
involve lepton fields only, so that the physics is not obscured by the phenomenology 
of strong interaction fields as was our example of pion decay. 

An effective Lagrangian density that describes the decays again couples the 
participating particles into currents. In fact all decays seen so far that involve just 
leptons are well described by the effective interaction Lagrangian density 

Aepton = -2V2 (9.8) 

with j ,x again defined by (9.2) or (9.6). A similar form for nuclear (3 decay was 
introduced by Fermi, and Gp is called the Fermi constant. The 2\[l is a related 
accident of history. 

The term in (9.8) that describes pT decay is 

£ — -2v / 2GFg /iV [e[df /i VeL^ L o-V L ]- (9.9) 

The most ready supply of muons comes from pion decays and these, as we have 
seen, are almost 100% polarised. The interaction Lagrangian density (9.9) implies 
a strong correlation between the angle 6 made by the direction of the electron with 
the direction of the muon spin, and the energy E e of the electron. In the muon rest 
frame, to lowest order of perturbation theory, and neglecting terms in (m c /m ; j 2 , 
the decay rate into an angular interval AO and energy interval d£ c is (see Donoghue 
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etal. 1992, p. 138) 


R(6, E e )d6 dE e = 


in„Gl r 

6 n 3 


3 

-m u 

4 M 

1 


+ cos <9 ( - £ e 


£ 2 d£ e sin OdO. (9.10) 


Integrating (9.10) over 6 and E t gives the total decay rate for this process 

1 


m l G l 


t(|x > e-VeV^) 1927T 3 ' 


(9.11) 


The total muon decay rate, which includes also decays with photons in the final 
state, for example the decays 


pT -* e“ +y + -v e +-v M , 
has been very accurately measured, giving 

= (2.19703 ± 0.00004) x 10 -6 s. 

A corresponding accurate theoretical expression that corrects (9.11) by including 
terms in {m e /m lx ) 1 and electromagnetic effects, gives 

G f = 1.16639(2) x 10 -5 GeV -2 , (9.12) 

which is the presently accepted value of this important constant. 

Further tests of lepton universality are provided by the decays 

T - -* pT + + -V T , T“ e“ + w e + W T , 

and their charge conjugates. These, like muon decay, are described by appropriate 
terms in the interaction Lagrangian (9.8). Since both (m e /m T ) 2 and (m M /m T ) 2 are 
small, the first-order formula (9.11) with m IL replaced by m x predicts these decay 
rates to be equal and ~ 4 x 10“ 10 MeV. They are indeed so within experimental 
error. Also from this formula 

T (T —> eWe'Vx) ^ 

t (p. -* ew e w,J \m r 

The ratio of the decay rates is 7.36 x 10 -7 and the ratio of the fifth power of the 
masses is 7.43 x 10 -7 . 

It should be noted that the coupling constant Gf has the dimension of (mass) -2 . 
The effective interaction (9.8) cannot be elevated into a quantum field interaction; 
see Section 8.4. 
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9.5 The interactions of muon neutrinos with electrons 

In the 1960s, intense muon neutrino beams were engineered at Brookhaven and 
at CERN. Muon neutrinos (or antineutrinos) were produced as secondary particles 
from the decay of n + (or n~) mesons in flight. It was from the observation that these 
neutrino beams produced almost exclusively muons rather than electrons, when in 
interaction with a target, that the distinction between electron neutrinos and muon 
neutrinos was established. 

The centre of mass energy yfs available in a collision of a neutrino with an 
electron at rest is relatively small, because of the smallness of the electron mass. If 
E y is the neutrino energy, 


5 = m e (2Ey + m e ), (9.13) 

(Problem 9.8). For example, if Ey = 30 GeV then s = (175 MeV) 2 , which will 
produce no more than a muon. Most neutrino interactions will be with the atomic 
nuclei in the target. However, here we consider only the interactions with electrons. 
The interaction 


+ e -* p. + -V e 


is included in the effective interaction Lagrangian density (9.8). In first-order per¬ 
turbation theory and averaging over electron polarisations, this Lagrangian predicts 
an isotropic differential cross-section in the centre of mass system: 


da 

dQ 


G% {s ~ mlY 

47t 2 5 


fhot 


Gj (* - m 2 Y 


7t 


(9.14) 


with .v the square of the centre of mass energy. (See Okun 1982, p. 134.) 

At the low energies available experimentally, the cross-section appears to be 
consistent with the theoretical form. The high energy structure is not easily explored 
experimentally, because of (9.13), but clearly the theoretical formulae become 
inadequate at high energies: the expressions (9.14) increase without limit as s 
increases, and for a ‘point’ interaction this is inconsistent with unitarity. Nor is 
it possible to improve the expressions within this framework, since the effective 
Lagrangian does not give a renormalisable theory. 

The most significant result to come from the experiments on neutrino-electron 
interactions was the observation of elastic scattering for both and : 


"V/j + e ->• + e , 

+e“ + e~, 

with cross-sections of a magnitude similar to those for muon production. Such elas¬ 
tic scattering is not included in our £ int (though there are terms corresponding to 
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cv c —> e^ e a nd cv e —> e^ e ). Thus another weak interaction must exist. The experi¬ 
mental investigation of this is difficult because of the smallness of the cross-sections 
at the available energies. We shall see from the Standard Model that the effective 
interaction Lagrangian required is again of current-current form, 


— G F / 
Ant = — O'l 


V 2 


neutral )//(./neutral > 


(9.15) 


where, in terms of Dirac spinors, 

OneutralA = “ kVe + Ae/'A Cy - C A y 5 )Ae (9.16) 

+ similar terms for the p.and t lepton families, 


and Cy and c A are parameters. The current is called a neutral current because it 
does not induce a change of charge as do the currents (9.2). (Note that it will also 
contribute to the scattering ev e - 4 - ew e .) 

Rewriting (9.16) with two-component spinors, 

OncutralA = (v e L) T d M V e L + Av + C A )e[a^e L 

+ (cy + ca^ct'Vr + similar p. and t terms. (9.17) 

In this form it is evident that right-handed lepton fields as well as left-handed 
are involved in the neutral currents. The parameters cy and c A are related to the 
Weinberg angle 0 W , which appears in the Standard Model, as we shall see in Chapter 
12 (equation (12.24)). The subscripts V and A refer, respectively, to the vector and 
axial vector nature of the terms in (9.16). (See Section 5.5.) 

One might anticipate that neutral currents are also present in atomic physics, 
and indeed they are. However, their effects are hard to discern experimentally. 
For example, they induce parity violation in atoms, but at atomic energies the 
weak interaction gives a very small effect. Indeed the decay of an unstable nuclear 
or atomic system through the neutral current must always compete with faster 
electromagnetic decays, and for this reason neutral current decays in these systems 
have never been observed. 


Problems 

9.1 In the decay of the n~ at rest, n~ —> e + Vy, show that 

1 (. u ?\ _ m e 

o \ n ) 2 . 2 * 

•A v c / m + m 

n e 
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The weak interaction: low energy phenomenology 
Show that the density of final states for the decay of Problem 9.1 is 


p{E) = 


V , 2 d Pe 

(27t) 3 4 Pe d E 


where V is the normalisation volume and 

d/r e _ E e 
d E m n 

9.3 Obtain the ratio of decay rates given by equation (9.4). 

9.4 The term in £ int describing the decay 7t~ e _ + -v e is 


Assume that this gives a corresponding term VTO) in the effective Hamiltonian, 

V ( 0 ) = —tx n J e\o* l v e i J d tl ® n d i -!L. 

(This assumption will be justified in Chapter 12.) 

The transition probability per unit time for the decay is to lowest order 

27t|(e p ,-v p ,|y(0)|7t-(rest)>| 2 P (£) 

where p(E) is given by Problem 9.2. 

Use the free field expansions given in equations (3.35) and (6.24), and Problem 
6.5, to evaluate the matrix element above and hence verify equation (9.3). 

9.5 Verify the equivalence of the expressions (9.2) and (9.6) for the current j 1 '. 

9.6 Taking the effective Lagrangian of Section 9.3, show that the conserved current asso¬ 
ciated with the C7(l) symmetry t// e —*■ e 10, i jr e , v e —► e 10, v e , is the electron electron- 
neutrino current 


y' i = +147^ v e - 

Show that the conserved current associated with e l[3 in the transformations (9.7) 
is 


V f eK''V f e + + i[( < i> t a M o - 

+a n u^& - rm 

Construct the Lagrangian density that results, when the electromagnetic field is 
introduced by elevating the global U( 1) symmetry of the phase factor e ,f ’ into a local 
gauge symmetry. 

9.7 Estimate G f from the expression (9.11) and the experimental lifetime t /( . 

9.8 Using a suitable Lorentz invariant, obtain equation (9.13). 
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9.9 Pick out the term in the effective Lagrangian density (9.8) that contributes to the 
scattering 

e” +-v e e~ + y e , 

and the term in (9.15) that contributes to the scattering 

e~ +-v M -> e~ +-v„. 

9.10 The K is like the n~, but with an s quark replacing the d. An effective inter¬ 
action with leptons is similar in form to equation (9.1), with <J> K replacing 
and a K replacing a n . Use the analogue of equation (9.4) to estimate the ratio 
r (K |a'v /x )/r (K — > eWg), and compare with the observed value (2.44 ±0.1) x 
10 -5 (m K = 493.68 Me'V). 

The mean life r (K~ —> p. - ^) is measured to be 1.948 x 10~ 8 s. Estimate a^/a n . 

9.11 Obtain the decay rate (9.5). 
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Symmetry breaking in model theories 


In Chapter 9, ‘effective’ weak interaction Lagrangian densities were constructed. 
When used in low orders of perturbation theory, these account well for the observed 
phenomena at low energies. Difficulties arise in higher order perturbation theory, as 
they do in quantum electrodynamics. There is, however, an important difference: it 
has been proved that these effective Lagrangian theories cannot be renormalised and 
they are therefore unsatisfactory. Furthermore, at higher energies new phenomena 
appear, and it is now well established experimentally that the weak interaction is 
mediated by the W + , W“ and Z bosons. How are these particles to be incorporated in 
a theory of the weak interaction that can be renormalised, and which has the same 
seeming inevitability as QED? The answer lies in the Weinberg-Salam unified 
theory of the electromagnetic and weak interactions. As an introduction to the 
Weinberg-Salam theory we shall in this chapter consider ‘model’ theories, the 
mathematics of which is fairly simple, but which contain the basic ideas we shall 
need. 


10.1 Global symmetry breaking and Goldstone bosons 

A possible Lagrangian density for a complex scalar field = ((f> l + i</> 2 )/\/2 is 

£ = - m 2 cpTcp (10.1) 


(cf. equation (3.32)). 

In this expression (3 T> V3f )(3 d>/3 1) can be regarded as the kinetic energy density 
and Vd>i • Vd> + nr T' T> as the potential energy density (see Section 3.3). If d> is 
constant, independent of space and time, the only contribution to the energy is 
m 2 <f> 1 <L. Since nr is positive this will be a minimum when (j)\ = r/r = 0. Thus 
= 0 corresponds to the ‘vacuum’ state. Consider now the Lagrangian density 
obtained by changing the sign in front of nr. This would be unstable: the potential 


102 

Put ie. 7fea£A.&wta£teaf ~Ph.yS.LC-A. 


10.1 Global symmetry breaking: Goldstone bosons 


103 



Figure 10.1 Plot of V — (m 2 /20g)[<t>*<t l — 0 2 ] 2 as a function of |<f>|; <t> is here a 
classical field. 


energy density is then unbounded below. Stability can be restored by introducing 
a term (/?7 2 /20 ( ])(<l> T< J>) 2 where </>q is another (real) parameter. For convenience we 
add a constant term m 2 <p q/2, and then 

£ — 


where 

V(&<t>)= (10.2) 

2 </>o 

The form of V is shown in Fig. (10. 1). The minimum field energy is now obtained 
with <t> constant independent of space and time, but such that = |<J>| 2 = cj) 2 . 
Such a field is not unique but is defined by a point on the circle |d>| = </> 0 in the 
state space (</> i, c/jo), so that the number of possible vacuum states is infinite. 

An analogy with magnetism is helpful. The Flamiltonian describing a 
Fleisenberg ferromagnet has rotational symmetry: all directions in space are equiv¬ 
alent. Flowever, in its ground state a ferromagnet is magnetised in some particular 
direction, which is not determined within the theory, and the rotational symmetry 
is lost. This is an example of spontaneous symmetry breaking. 
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The Lagrangian density (10.2) has a ‘global’ 17(1) symmetry: d> —»■ 4>' = 
e _1Q, d>, £—>£! = £. for any real a. Equivalently, 

4> j = <f> i cos a + 4 >2 sin a, 
fa = — (pi sin a + fa cos a. 


The transformation rotates the state round a circle | <f>| 2 = constant in the state space 
{4> i, 4> 2 ). If we pick out the particular direction in (0 1 , fa) space for which <J> is real, 
and take the vacuum state to be (</> 0 , 0), we break the 17(1) symmetry. 

Expanding about this ground state (4> 0 . 0), we put = 0 O + (l/fa2)(y + iVO- 
The Lagrangian density becomes 


2 r 


£ = fad.yjfay + ^ 

2 A 2 " r r 2(f>Q L 




-1 2 


(10.3) 


After breaking the 17(1) symmetry we must interpret the new fields. (In much the 
same way, the excited states of a ferromagnet cannot be discussed until the spatial 
symmetry has been broken.) In place of the complex field T. we have two coupled 
scalar real fields y and 1 //. We write 

£ = £ f + £ , 
free 1 int 

where 

Area = X - m 2 y 2 + ^8^ fa (10.4) 

£ lrce represents free particle fields, and contains all the terms in £ that are quadratic 
in the fields. For classical fields and small oscillations, these terms dominate. The 
rest of the Lagrangian density, £ mt , corresponds to interactions between the free 
particles and higher order corrections to their motion. 

There is a quadratic term — m 2 y 2 in (10.4), so that the y field corresponds to 
a scalar spin-zero particle of mass \Jlm (by comparison with (3.18)). In the case 
of the \l/ field there is no such quadratic term: the corresponding scalar spin-zero 
particle is therefore massless. The massless particles that always arise as a result of 
global symmetry breaking are called Goldstone bosons. 


10.2 Local symmetry breaking and the Higgs boson 

We now generalise further, and construct a Lagrangian density that is invariant 
under a local 1/(1) gauge transformation, 

<J> —> <J> / = e -i?e <J>, 

~PuH.C- TfcatAcMotica/ ~Ph.yS.LC-A. 
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where 0 = 0(x) may be space and time dependent. This requires the introduction 
of a (massless) gauge field A^, as in Section 7.5, and we take 

£ = \(d lx - iqA^m* + iqA^m ~ (10.5) 

where F^ v = 3 M A y - 9 y A M , and again 

V (&$>)= b-</> 2 ] 2 . 

20 o - L 

£ is invariant under the local gauge transformation 

d>(.r) $'(*) = e _k?e <I>(x), A jU (x) ->■ A'^x) = A M (x) + 9 At 0(x). 


A minimum field energy is obtained when the fields A t , vanish, and <J> is constant, 
defined by a point on the circle | <J> | — 0 O . Any gauge transformation on this field 
configuration is also a minimum. Again we have an infinity of vacuum states. 

Given 4>(x), we can always choose 0{x) so that the field d>'(-x) = e -19fJ< J>(x) is 
real. This breaks the symmetry, since we are no longer free to make further gauge 
transformations. 

Putting 4>'(x) = (/to + h(x)/V 2, where h(x) is real, gives 


£ = [(3,, - i* a;)(0o + h/V 2)][(9" + i qA'^Mo + h/y/ 2)] 

1 l 2 

V2(p 0 h + -h 2 . 

For clarity, we again separate this into 


1 p’ f'n-v _ m 

4 1111 2 02 


( 10 . 6 ) 


£ = £ free + £ int 

where, dropping the primes on the gauge field, 

4-ee = \^hd^h - m 2 h 2 - l -F^" + q 2 4>lApA», 

Ant = ^A^ ^V20o h + V) - ^ (V2fob + 

Before symmetry breaking, we had a complex scalar field <t> = ((j)\ + i</> 2 )/V2, 
and a massless vector field with two polarisation states (Section 4.4). In £ free we 
have a single scalar field h(x) corresponding to a spinless boson of mass s/2m, 
and a vector field A M , corresponding to a vector boson of mass Vlqcjto, with three 
independent components (Section 4.9). 

This mechanism for introducing mass into a theory was invented by Higgs (1964) 
and others (for example Anderson, 1963), and the particle corresponding to the field 
h(x) is called a Higgs boson. As a consequence of local symmetry breaking the gauge 


-h‘ 


(10.7) 
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Symmetry breaking in model theories 


field acquires a mass, and the massless spin-zero Goldstone boson that appeared 
in our example of global symmetry breaking in Section 10.1 is replaced by the 
longitudinal polarised state of this massive spin one boson. 

In the Weinberg and Salam ‘electroweak’ theory, the masses of the W £ and 
Z particles arise as a result of symmetry breaking. The resulting theory can be 
renormalised, whereas the phenomenological theory of Chapter 9 cannot be renor¬ 
malised. The form of VTT'O) that has been introduced in this chapter appears also 
in the electroweak theory. It may seem a somewhat arbitrary feature. However, it 
can be shown to be the most general form that can be renormalised. 


Problems 


10.1 What interaction term in the model Lagrangian density (10.3) allows the massive 
boson to decay into two Goldstone bosons? Show that the decay rate in lowest order 
perturbation theory is 



1 


r(x t|nW 1287T V (p 0 J 

10.2 Show that with the model Lagrangian density (10.7), the vector boson would be 


stable, but if the coupling constant q < m/(2(f>o) the scalar boson would decay into 
two vector bosons. 
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Massive gauge fields 


In the preceding chapter (Section 10.2), we set up a simple Lorentz invariant 
Lagrangian density, which we required to be also invariant under a local (7(1) 
transformation. This requirement leads to the introduction of a ‘gauge field’ A^. 
The system has a degenerate ground state. Breaking the local symmetry results in 
the appearance of a vector field carrying mass, together with a scalar Higgs field 
also carrying mass. The motivation for introducing mass in this way is that the 
subsequent quantum theory can be renormalised. In this chapter we apply the same 
idea to a more complicated Lagrangian, which will turn out to have remarkable 
physical significance. 


11.1 5(7(2) symmetry 


As a further generalisation, which is basic to the Standard Model, we shall construct 
a Lagrangian density that is invariant under a local 5(7(2) transformation as well as 
a local (7(1) transformation. The idea was first explored by Yang and Mills (1954). 
We introduce a two-component field 



( 11 . 1 ) 


where now 4 >a and 4 >b are both complex scalar fields, 


T’a — </>! + d>B — 4 > 3 + i04> 


giving, in total, four real fields. 

If c~'° is any element of the group (7(1) and U is any element of the group 5(7(2) 
(discussed in Appendix B), so that U' U = UU' = 1, we require the Lagrangian 
density to be invariant under the (7(1) x 5(7(2) transformation 


<f> <f>' = e 10 UT>. 


( 11 . 2 ) 
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A simple Lagrangian density that has a global 17(1) x 517(2) symmetry is 

£* = - V($i<I>). (11.3) 

In terms of the real fields, 

= <2>* <J> A -j- Cp* 3> g = (f>l + (j>2 + </>3 + <j>4, 

d,^d^ = + 1V0 2 3 m 02 + 9 M </> 3 9 M </>3 + 9^49^4- 

If V(3>t<J>) = m 2 3>'3>, this Lagrangian density corresponds to four independent 
free scalar fields, all with the same mass m (cf. (3.18)). 

In the Standard Model, the 17(1) and 517(2) global symmetries are promoted to 
local symmetries. The 17(1) transformation may be written 

<D -► <D' = e“ ie 3> = exp(-i<9r°)3>, (11.4a) 

where in this context we write r° for the unit matrix 



For this to become a local symmetry, we must introduce a vector gauge field B^x) r° 
with the transformation law 

B^x) -* b; (x) = B,Ax) + (2/g^e, (11.4b) 

and make the replacement 

i9 M -> id)i - (gi/2)B M , 

as in Chapter 7. Here the constant g\ is a dimensionless parameter of the theory, 
and the factor 2 follows convention. 

Any element of 517(2) can be written in the form 

U — exp(—ia k r A ) (11.5) 

where the a k are three real numbers and the r k are the three generators of the group 
517(2). The r k are identical to the Pauli spin matrices: 



For the global 517(2) symmetry to be made into a local 517(2) symmetry, with U = 
U(x) dependent on space and time coordinates, we must introduce a vector gauge 
field W^ix) for each generator r k . The transformation law for the matrices 

W M (x) = W/(x) r k 
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is 


W ,Ax) -* W^(jc) = U(x)W^x)V\x) + (2i/g 2 )(3 /x U(x))U t (x), (11.6) 


which is a generalisation of (11.4). Here g 2 is another dimensionless parameter of 
the theory. 

Note that the matrices 


W ,Ax) 


( K 

\K + 'K 


K~'K\ 

-K ) 


(11.7) 


are Hermitian and have zero trace. These properties are preserved by the transfor¬ 
mation (11.6) as is clearly necessary (Problem 11.1). A global SU(2) transformation 
W^= UW,, U' corresponds to a rotation of the vectors W t , k in the three-dimensional 
‘weak isospin’ space defined by the generators r k . (See Appendix B.) 

Finally we define 


D^ = [dp +(ig 1 /2)B ll + (ig 2 /2)W,J<I>. (11.8a) 

It is straightforward to show 

0 ; >' = \d fl + (igi/2 )B , fl + (ig 2 /2)W;jd>' = e- ie UD^, 


where 


= e _ie U<f). (11.8b) 

Hence the locally gauge invariant Lagrangian density corresponding to (11.3) is 

£ 0 = - y(<F t cF). (11.9) 

£ <t> is also invariant under Lorentz transformations if we require Ifi, and to 
transform as covariant four-vectors. 


11.2 The gauge fields 

In the case of the gauge field B jL , we define the field strength tensor B /I V by 

B ilv = d lx B v - d v B IL , (11.10) 

and take the dynamical contribution to the Lagrangian density to be — (1 /4) B /U ,B' JV , 
as in Section 4.2. 
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Massive gauge fields 


There are additional complications in introducing the field strength tensors for 
the gauge fields W M , stemming from the non-Abelian nature of the group SU{2). 
The field strength tensor must be taken to be 

= [9,, + (ig 2 /2)W M lW, - [d v + (ig 2 /2)W v ]W M . (11.11) 

Under an SU{2) transformation, —> W|, given by (11.6), it is straightforward, 

if tedious, to show that 

W„ v -► w '„ = UW^U f . (11.12) 

In verifying this result, note that, since UU* = 1, 

UO^U 1 ) + (9 m U)U ! = 0. 

The complicated definition of W /IV given by ( 11.1 1) is necessary in order to achieve 
the simple transformation property (11.12). 

We then take the total dynamical contribution to the Lagrangian density associ¬ 
ated with the gauge fields to be 

v = -\b i *vB» v - ^Tr(W^.W^). (11.13) 

Using (11.12) and the cyclic invariance of the trace, we can see that £ dyn is invariant 
under a local SU{2) transformation. 

Using the results [r 2 , r 3 ] = 21T 1 , etc., the matrix may be written 

W uv = Wl v r i (11.14) 

where 

K = 9 ^‘ - dvW l - (K W v - W vK) ■ (ll- 15 a) 

K = - 9vW 2 - g 2 (WX 1 - W>>) , (11.15b) 

K = w 3 ~ dvW l ~^(K w v - w l w l )• (ii-isc) 

Since Tr(r') 2 = 2, and Tr(r ! r ; ) = 0, i ^ j, we can use (11.14) to express the 
Lagrangian density in the more reassuring form: 



We shall see, later in this chapter, that the fields W J and W 2 are electrically 
charged, and it is convenient to define here the complex combinations 

K = ( K ~ iW u)/^ W~ = « + ( 1U7 ) 
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Note that the field W is the complex conjugate of the field W~. We also define 


Kv = (Kv- iw l)/^ 

= (3, + i g 2 W^)W+ - (d v + ig 2 W^)W+ 

using (11.15a) and (11.15b). W~ v is defined similarly. 

We can also write (11.15c) as 


(11.18) 



(11.19) 


and (11.16) becomes 



( 11 . 20 ) 


11.3 Breaking the 517(2) symmetry 
As in equation (10.2) we take V(47<f>) to be 



( 11 . 21 ) 


where <pa is a fixed parameter that is the analogue of (10.2). With this expression 
for V, the vacuum state of our system is degenerate in the four-dimensional space 
of the scalar fields. We now break the 517(2) symmetry. At our disposal we have the 
three real parameters a k (x) that specify an element of 517(2). We use this freedom 
to adopt a gauge in which for any field configuration <f> ,\ = 0 (two conditions) and 
d> B is real (one condition). The ground state is then 



( 11 . 22 ) 


and excited states are of the form 



(11.23) 


where the field h(x) is real. 

A local 17(1) symmetry remains: the fields (11.23) are unchanged by a 17(1) x 
517(2) transformation of the form 



(11.24) 
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Such matrices give a 2 x 2 matrix representation of the group U( 1). This residual 
symmetry will turn out to be the 17(1) symmetry of electromagnetism. 

We wish to express £ (i> (equation (11 .9)) in terms of the field h(x). We have from 
( 11 . 21 ) 


, 0 t m 2 b? m 2 h 4 

V( O) = m 2 h 2 + ——- + =- = V(h), 


V20o §</> 


o 


and from (11.8a) and (11.7) 

0 


1 Wh/yfi) + 2 (^(00 + fi/V 2)) + 2 l -Wfj(P 0 + h/V2) ) 


, ig 2 lV2W+(fi 0 + h/V2)\ 

+ — 


Multiplying (D^chfi by we find 

= ; dphd»h + dw-w+itf 0 + /i/V2) 2 

V ^W 2 W 211 - — W 3 £ M + — B,B^ 

4 M 2 M 4 M 

1 ' „2 

= -d^hd^h + o + fi/V2) 2 

+ + g 2 2 )Z„Z^(0„ + fi/V2) 2 - V(fi). 

We have written 


(0o + /i/V2) 2 - V(A) 


(11.25) 


Z„ = W 3 cos - B sl sin 9 W , 


(11.26) 


where 


cos 9 W = 


g 2 


+ si) 


2\l/2’ 


sin6L = 


gi 


(*? + *!) 


2\ 1/2 ' 


7 W is called the Weinberg angle. 

Along with the field Z^, we define the orthogonal combination 


A u- = w u sin ^ COS 0 W . 


(11.27) 


(11.28) 


Equations (11.26) and (11.28) correspond to a rotation of axes in ( B t ,, W 3 ) space. 
The rotation can be inverted to give 

B[jl = cos 0 W Z M sin 0 W , 

W 3 = A,, sin 6> w + Z, A cos 6> w . 

Substituting in (11.10) and (11.19) gives 


(11.29) 


Bfj,v — A^ v cos 9 « 


Z, lv sin 


W 3 V = A mv sin(9 w + Z^ cos0 w - i g 2 {W~W+ - W~W+), 
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where 


= 3 - d v A fl is the F /x , of Chapter 4) 

and 

= 3^z„ - 3^. (11.30) 

11.4 Identification of the fields 

We are now in a position to rearrange the terms in the full Lagrangian density 
£ = £ 0 + £ dyn to reveal its physical content. In £ dyn (equation (11.20)) we use 
(11.29) and (11.30) to express the field If, and in terms of the fields A fl and 
Z /; , and then we may write 

£ = A + A> 

where 

£. = - dahd^h - m 2 h 2 
1 2 M 

_ I 4 At LV 

- ^[(^W v + )* - (A,W;)*][D"W +l> - + l -glfilW-W + ^ 

(11.31) 

and D„W+ = (3 M + ig 2 sin0„ ; A d )W+. 

£, is relatively simple: you will recognise it as the Lagrangian density for a 
free massive neutral scalar boson field h(x), a free massive neutral vector boson 
field Z (U ,(x), and a pair of massive charged vector boson fields Wf(x) and W~(x), 
interacting with the electromagnetic field A m (jc). 

£ 2 is the sum of the remaining interaction terms. As the patient reader may 
verify, 

A = (^ 2 + ^o) (glW-W + ^ + \{S\ + d)Z^) 

- — ZLA- + ^ (w _ w + _ w _ w + )(w _At w +v - w _y w +M ) 

V20 O 8^ 4 l ^ v v ^ ^ 

+ Sin0u ’ + cos 0 u>X w_,i w+v - W~ V W+^) 

- g 2 cos 2 d w (Z„Z^WfW +v - Z, L Z v WfW + ^) 


"Pil+lcx 7feo£A.&wta£teaf P/t^l ic.1 




114 


Massive gauge fields 


+ ~y cos 9 W \(Z, L Wfi - Z V W~)(D^W +V - D"W + ») 

- (Z M W+ - Z v W+)(D> l W +v )* - (D v W +fi )*)]. (11.32) 

Most of the 17(1) x 517(2) symmetry with which we began has been lost on 
symmetry breaking. In particular, no trace of the original 517(2) symmetry is to be 
seen in the interactions described by £ 2 . Nevertheless it is precisely this complicated 
set of interactions that makes the theory renormalisable, as it would be if the 
symmetry were not broken. 

We identify the three vector fields, W+, W~, Z M , with the mediators of the 
weak interaction, the W + , W~, Z particles, which, subsequent to the theory, were 
discovered experimentally. The masses are (Particle Data Group, 2004) 


M w = 80.425 ± 0.038 GeV, (11.33) 

M z = 91.1876 ±0.002lGeV. (11.34) 

From (11.31) and Section 4.9, we identify 

(/> 0 g 2 /V2 = M w , (11.35) 

Ms^ + slY /2 /^2 = M z . (11.36) 

Then, from (1 1.27), and neglecting quantum corrections to the mass ratio, 

cos6» w = M w /M z = 0.8810 ±0.0016. (11.37a) 


It is usual to quote the value of sin 2 6 W , which will appear in later calculations. 
The estimate above would suggest 

sin 2 0 W = 0.23120 ±0.00015. 

The uncertainty arises mainly from uncertainty in M w . Other ways of estimating 
sin 2 9 W exist and the accepted value (in 1996) was 

sin 2 # w = 0.2315 ± 0.0004. (11.37b) 

We shall adopt this value in subsequent calculations. 

The W^ bosons are found experimentally to carry charge ±e. In (11.31) the 
gauge derivative is 

= (3 m ± ig 2 sin6> w A / JW y + , 

so that from the coupling to the electromagnetic field A and (11.27) we can 
identify 


e = g 2 sin$ w = gi cos6» w . 


(11.38) 
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The fields W*, W*, and Z /; have free field expansions similar to (4.15) but with 
three polarisation states (see Section 4.9). As a quantum field W+ destroys W + 
bosons and creates W“ bosons; W~ destroys W“ bosons and creates W + bosons. 

There remains the scalar Higgs field h(x). The vacuum state expectation value 
cp o of the Higgs field is, from (11.35), 


</>o 


V2M W 

g2 


^ M - Sin9 ’ = ISOGeV. 
e 


(11.39) 


The only parameter not fixed from experiment is the mass M H = Vim of the 
Higgs boson. No Higgs boson has yet been identified experimentally, though 
its existence is, apparently, an essential part of the Standard Model. The fail¬ 
ure so far of experimental searches to find the Higgs boson suggests M H > 
64 GeV. Recent experimental and theoretical studies suggest an M H close to this 
limit. 

The requirements of 17(1) and 517(2) symmetry, followed by 517(2) symmetry 
breaking, have generated the electromagnetic field, the massive vector W ± and Z 
boson fields, and the scalar Higgs field, in a remarkably economical way. In the next 
chapter, we add lepton fermion fields to these boson fields, to obtain the richness 
of the Weinberg-Salam electroweak theory. 


Problems 

11.1 Show that the W' defined by (11.6) are Hermitian and have zero trace. (Use the 
expression (B.9) of Appendix B: U= cosal+i sina(a • r).) 

11.2 Verify that the expressions (11.13) and (11.16) for £ d are equivalent. 

11.3 Verify that the last two terms on the right-hand side of (1 1.31) correspond to a pair 
of massive charged vector boson fields. 

11.4 Show that the Higgs boson can decay to two photons, in the third order of perturbation 
theory. Draw the appropriate Feynman graph. 

11.5 Under an 51/(2) transformation, $ —»• <J>' where 





Using (B.9), show that x 2 U* = Ur 2 . Hence show that 
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11.6 Show that the SU( 2) matrix U = e 

U = 


cos cn 


with a — a(sin (p, cos 0, 0) is 
e^‘ 


sin a 


sin a 
cos a 


Show that under the SU{2) transformation O' = UO, the two-component complex 
held 


O 




can be put in the form 


O' 





b 2 


taking (f> = (8 — y) and a = — tan l (a/b). Show that O' can then be put in the 
standard form (11.23) by a further SU(2) transformation with a = y(0, 0, 1). 





12 

The Weinberg-Salam electro weak theory for leptons 


We shall now couple the lepton fields to all the gauge boson fields: the electromag¬ 
netic field, the W + and W~ fields, and the Z field. We know that at low energies 
the theory must reproduce the phenomenology of Chapter 9. This consideration 
and the principles of C/(l) x SU(2) local gauge symmetry determine the couplings 
uniquely. 

We have seen how the Higgs mechanism gives mass to the W* and Z bosons. To 
give mass to the charged leptons: the electron, the muon, the tau, they too must be 
coupled to the Higgs field. We shall finally arrive at the Weinberg-Salam unified 
theory of the electroweak interaction. 


12.1 Lepton doublets and the Weinberg-Salam theory 


We shall first construct a Lagrangian density for lepton fields that is invariant under 
1/(1) and S'1/(2) transformations. The left-handed electron spinor e\ and the electron 
neutrino spinor v eL are put together in an SU(2) doublet, like the Higgs fields in 
equation (11.1), 



( 12 . 1 ) 


We are now again specialising our notation; two-component left-handed and right- 
handed spinors were denoted by i/t and x/tr , respectively, in Chapter 6. Under an 
SU(2 ) transformation, this doublet transforms in exactly the same way as the Higgs 
doublet: 


L -* U = UL. 


( 12 . 2 ) 


Since SU{2) transformations mix the two spinor fields making up the doublet, 
to maintain Lorentz invariance only fields with the same Lorentz transformation 
properties can be combined together into a doublet. 
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From the phenomenology of Chapter 9 the right-handed lepton fields do 
not couple to the W boson field so that e R and u eR are invariant under SU(2) 
transformations: 


e R ->• e' R - e R . v eR -> u' R = u eR . (12.3) 

To be consistent with the transformation rule (12.2), all SU(2) gauge derivatives 
must be of the same form, 3 /x + i(g 2 /2)W A , where g2sin0 w =e, as in (11.8) and 
(11.38). This is a consequence of the non-Abelian nature of the group SU(2). 
However, there is no similar constraint on the coupling constant to the £7(1) gauge 
field If,. (See Problem 12.1.) We may take 

L = [3^+zG> 2 /2)W f +i(g72)S/i)]L, (12.4) 

where g' remains at our disposal. We must choose g' so that the neutrino is neutral 
and the electron has charge —e. The terms in D /x L which couple to the electromag¬ 
netic field A jt are linear combinations of and If,. Using (11.7) and (11.29) the 
terms in A jt are 

/ 3 M + {i(g 2 /2) sin 6» w + i(g'/2) cos 6UA,,, 0 \ / v 6 l \ 

VO, 3,u + {-i(g2/2)sin6» w + i(g72)cos9 w }A M / V^l / 

The gauge derivatives tf, v eL and {(f, — ieA /x )cl which leave the neutrino electrically 
neutral but impart electric charge — e to e\_, are obtained with the choice 

g'cosO w = —g 2 sin0 w = — e. 

The complete gauge derivative of the left-handed fields is then 

D L = ( d/J + i( T/ sin26l w ) z t>’ He/(V2sinO w )}W+ \ /v eL \ n 2 5) 
M y i{e/{sf2 sin6» w )} W~, 3 M - ie A M - ie cot{2 9^)1,, J \e L / 

where we have used (11.7), (11.17) and (11.29). 

The gauge derivative of e R must be of the form 

Df 1 eR= [3, i +i(g ,, /2)B /t ]e R . 

Since the electron has charge — e we take g" = —2c/cos 9 W 
so that, using (11.29) again, 

D !JL e R = + ietan0 w Z IJl ]e R . (12.6b) 


(12.6a) 

= —2gi, (see (11.38)) 


With g" = —2g\ and g' = —g\, it can easily be checked that, under a local 
U{ 1) x .ST/(2) transformation 


L - 

<?R 


V = e ie(x) V{x)L, 

► 4 = e W(x) e R , 
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the gauge derivatives satisfy 

D; V = (9 m + i(g 2 /2)W/ + i(g'/2)B/)L' =e' (J UD fl L 
D/e R ' = (d M + i(g"/2)B/)e R ' = e 2ie D M e R , 

where the fields B fl and W /( transform as in (11.4b) and (11.6). 

We can now construct a gauge invariant and Lorentz invariant expression for the 
dynamical part of the Lagrangian density for the electron and the electron neutrino: 

£ dyn = L^iD^L + e t R cr ,l iD M e R + vJ~ R a^id^v eR . (12.7) 

The gauge invariance follows from our construction of the gauge derivatives, and 
the Lorentz invariance from the spinor properties set out in Section 5.4. (Remember 
that the a M matrices act on the spinor indices, whereas the SU(2) transformation 
acts independently on the components of the doublet of spinor fields.) Note that 
besides the interaction with the electromagnetic field we have fully determined, 
from the factor D jL L, all the interactions with the heavy vector bosons. 

Finally, we must give mass to the charged leptons. A gauge and Lorentz invariant 
contribution to the Lagrangian density that will impart mass to the electron but leave 
the neutrino massless is (neutrino mass will be introduced in Chapter 19) 

£ mass = -CeKL+O^R + 4(4> f L)] 

t t t t t ( 12 -°) 

= — c e [(v L 4)A + e h < S> R )e R + e R (4> A v L + 

where <J> is the Higgs doublet field and c e is a dimensionless coupling constant. 
After symmetry breaking (see (11.23)), £f nass becomes 

£ mass = -Cc<Po(e[e R + e ] R e R ) - ^ (e\e K + e f R e L )■ (12.9) 

Comparing this with the Dirac Lagrangian density (5.12), we identify c e cf >o with 
the electron mass m e . Introducing mass by following the principles of symmetry 
has left us no option but to introduce an interaction between the electron field and 
the Higgs field h(x). Hence the coupling constant to the Higgs field is 

c e A 

-4= = — = 2.01 x 10“ 6 (12.10) 

V2 V2</>o 

(using (11.39)). It is just as well that c e is small: we do not want this term to upset 
the calculations of QED! 

The total Lagrangian density £ e for the electron and its neutrino is given by 
(12.7) and (12.8): 


£) e _ J) e 

^ ^dyn ' ^mass* 


( 12 . 11 ) 


7feafA.&wtadeaf P/t^J ic.4 



120 


Weinberg-Salam electroweak theory for leptons 


From £ e we can pick out the terms 

£ Dirac = - ieAJe L + v f eR a^v eR 

+ 4 CT# *i(3/t “ ieA ^)e R ~ m c(e}f J R + e R e L ), 

which correspond to the expressions we found in Chapter 6 and Chapter 7 for a 
Dirac massless neutrino, and a Dirac electron of mass m e and charge —e. in an 
electromagnetic field. 

The Lagrangian densities £ /J and £ T for the muon and tau leptons and their neu¬ 
trinos differ from (12.11) only in their mass parameters and, hence, their couplings 
to the Higgs field: 

c u m u A c r m T , 

-£= = =4.15 x 10“ 4 , -j= = = 6.98 x 10“ 3 . (12.13) 

V2 v20o v2 v20o 

The coupling constant go of the 517(2) gauge theory, or, equivalently, the Weinberg 
angle 0 W (see (11.38)), which determines the coupling to the and Z fields, must 
be the same for all leptons, a feature of the theory that is forced on us by the 517(2) 
group, and that is known as lepton universality. 

The complete Lagrangian density £ ws of the Weinberg-Salam theory (Wein¬ 
berg, 1967; Salam, 1968) is the sum of the lepton contributions, and the boson 
contributions given by (11.31) and (11.32): 

£ ws = £ e + £ M + £ T + £ bosons , (12.14) 

The form of £ ws has been determined by considerations of symmetry: invariance 
under Lorentz transformations, and under 17(1) and 517(2) transformations. Massive 
bosons and leptons appear through the Higgs mechanism of local symmetry break¬ 
ing. It has been proved by t’Hooft (1976), who introduced radically new methods 
of analysis, that the theory is renormalisable. We shall see in Chapter 13 that there 
is a great body of data that supports it. 


12.2 Lepton coupling to the W ± 

The coupling of the electron and the electron neutrino to the W + and W~ gauge 
fields is given by the appropriate terms in (12.5) and (12.7), which are 

4w = -(g 2 /V 2 )vl L a^e L W+ - (g 2 /V2)e[^v cL W; L 

= -(gi/^2)\j , c l] W+ + j^W~]. (12.15) 

The right-handed fields do not contribute to this interaction. As in Chapter 9 the 
currents are defined as 

j? = elt ? M VeL, jl A = vl L o ll e L . (12.16) 
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There are similar muon and tau currents, giving a total lepton current 

,/ M = (4<7 M n cL + /4 A'Xl + (12.17) 

and total interaction Lagrangian density 

£ lw = -(g 2 /V2)[j^W+ + rw~]. (12.18) 


The effective £ |epton used in the discussion of muon decay in Section 9.4 can be 
obtained as the low energy limit of the Weinberg-Salam theory. Since the mass 
M w is so large, at low energies the term M 2 W~ W +IM in (1 1.31) dominates in the 
W contribution to the Lagrangian density, and 

Av ~ Mlw;w + » - (gi/^y^wl + (12.19) 


Physical field configurations correspond to stationary values of the action. Varying 
4T+ and W~ independently gives the field equations 

MlW~ = (g 2 /y/2)jl , MlW+ = (g2/Vl)j„ ( 12 . 20 ) 

and using these in (12.19) gives 

A, * (12.21) 


£ w is equivalent to the effective £, n of (9.8) if we make the identification 


Gf = 


“'lepton 

si 


4V2M2 4V2A/2 sin 2 # w ' 


( 12 . 22 ) 


Taking M w = 80.33 Gev, M z = 91.187 GeV, sin 2 # w = 1 — M 2 /Mr, gives Gp - 
1.12 x 10 -5 GeV -2 , which is in good agreement with the accepted experimental 
value, 1.166 x 10 _5 GeV -2 . Historically, the knowledge of Gp, together with an 
estimate of 9 W (see Section 13.1) was used to predict the masses of the W* and Z 
bosons, and the CERN proton-antiproton collider was then built to find them. 


12.3 Lepton coupling to the Z 


The coupling of the leptons to the Z field can be extracted from the terms involving 
Z„ in (12.7): 


Az - 


-vllA^eL 


z„ + e\ d'V L 


sin(20 w ) 1 M ' " L 


e cos(26» w ) 


sin(2# w ) 

—e^a^e r(c tan 0 W )Z^ (using (12.5) and (12.6b)) 

“ T^TTT (./neutral V Z A 

sin(20 w ) 
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where 


(/neutral)^ = - cos(2 O w )e f L a^e L 

±2sin 2 0 w <?^,a±<? R . (12.23) 

There are similar expressions for £ and £ zz . Note that the right-handed charged 
lepton fields also couple to the Z field but not the right-handed neutrino. 

The low energy limit of £ z may be obtained in the same way as we obtained 
the low energy limit £ w in Section 12.2, with the same identification of coupling 
constants, and is identical with the effective Lagrangian density (9.15) if, comparing 
(12.23) with (9.17), 

1 1 , 

CA=-~, Cy = -- + 2shr0 w . (12.24) 


The low energy muon neutrino-electron elastic scattering cross-sections calcu¬ 
lated from the effective Lagrangian density are 


cr(v n + e -* + e ) 
cr(iV + e“ ->• + e“) 


7r 

G\s 

i r 


'4 , 1 , 

- sin 0 W - - sir 0 W + 


(12.25) 

(12.26) 


where 5 is the square of the centre of mass energy and E v 7> m e (see Perkins, 1987, 
p. 327). 

These low energy (-7C M z , M w ) cross-sections have been measured at CERN 
(CHARM II Collaboration, 1994), and their ratio yields an estimate for sin 2 @ w = 
0.2324 ±0.0083. 

The Fermi constant Gp is also known experimentally from low energy phenom¬ 
ena, and e is of course well known. Hence within the framework of the Weinberg- 
Salam theory the masses of the Z and 'W ± gauge bosons can be estimated from low 
energy data alone, using (12.22) and (11.37). (Earlier estimates of sin 2 0 w came 
from neutrino-nuclear scattering.) 


12.4 Conservation of lepton number and conservation of charge 

The Weinberg-Salam Lagrangian density £ ws has also further independent global 
(7(1) symmetries. It is invariant under the (7(1) transformation L e —> e 1Q, L e , c R —> 
e la ± R , where a is a constant phase (see (12.7) and (12.9)). Using the device (by 
now familiar) of varying a so that a —► a + Ha(x), where Sot is space and time 
dependent, the first-order variation in the action comes from the dynamical part of 
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£ e dyn (equation (12.7)), and is 

8S = -J L^L d^Sa) d 4 x - J e^e*d^Sa) d 4 x 

= j [d„(V^L) + d^eia^e^pa) d 4 x, 


on integrating by parts. Setting 8S — 0 for arbitrary 8a yields 


3 m (u[<t /x v l + 4<^l) + d/i (e^a^e*) = 0, 


or 


9 , 4 {J?) = 0 , 


(12.27) 


where 


J e = V L V L + e L e L + e R e R’ 

J e ~ + e i°'eL + ep'e R . 


(12.28) 


Equation (12.28), which we may write as 



(12.29) 


expresses the conservation of electron lepton number. Similar 17(1) transformations 
applied to the muon and tau parts of £ ws give the conservation of muon lepton 
number, and tau lepton number. We will see in Chapter 19 that the inclusion of 
Dirac neutrino mass into the Standard Model reduces these three conservation laws 
to one. 

As in Chapters 4 and 5, the inhomogeneous Maxwell equations can be obtained 
by varying A M . There are contributions to the electric current from the charged W ' 
fields, as well as from the charged leptons. Conservation of charge follows from 
Maxwell’s equations, but can be obtained more directly from the 17(1) symmetry 
apparent in each term of the Weinberg-Salam Lagrangian density (12.14): 


e ia e L , e R -* e 10, £> R ; fi L -> e‘“/x L , Mr 


e'Vs; r L 



(12.30) 


12.5 CP symmetry 

We saw in Chapter 5 (equation (5.27)) that under space inversion a left-handed 
spinor t/tl transforms into a right-handed spinor x// R , and vice versa. The Weinberg- 
Salam Lagrangian does not have space inversion symmetry, since only the left-hand 
components of the lepton wave functions are coupled to the SU{2) gauge field W M . 
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We also discussed in Chapter 7 the operation of charge conjugation, 


V'L = V^R = i0"V £, 


which relates solutions of the Dirac equation for particles to solutions for antipar¬ 
ticles. In the Weinberg-Salam theory there is no charge symmetry. 

The Weinberg-Salam Lagrangian does exhibit a symmetry under the combined 
CP (charge conjugation, parity) operation. This symmetry implies that the physics 
of particles described in a right-handed coordinate system is the same as the physics 
of antiparticles described in a left-handed coordinate system. 

Under the combined CP operation, lepton fields transform according to 



(12.31) 


The other fields in the electroweak theory transform as set out below: 



(7(1) gauge fields: B^ p = —B 0 , Bf p = Bj. 

5(7(2) gauge fields: 

/ W\ W^-iW 2 0 \ CP = _( W 2 W^ + iW 0 2 \ 

W + iWg -W 2 ) Wo—i^o -Wl )' 

r 'd 



It follows that 



(12.32) 


Space derivatives of fields are replaced by their negatives. 

To show that the Lagrangian density is invariant under these transformations 
requires some care. We demonstrate it here for just one term, but one which involves 
all the necessary steps in the complete argument, and we leave the remaining terms 
to the reader. Consider then the term from the expression (12.7) 


4 ffM i[ 9 ^ + i(g"/2)B A ,]e R = /, say. 
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l CP = 4(O t i[3, - i (g"/2)B tl ]e* R , 
where we have used the results 

(ct 2 ) 2 = 1, o-V'ct 2 = -(o-') T . 

The operators 3, now act on the conjugate fields. In fact l cp is not identical to l, but 
differs from it only by a sum of total derivatives and, as explained in Section 3.1, a 
total derivative is of no consequence. If we add to l CP the terms —i9 /i [4(°' / *) T 4 \ 
we obtain 


-i (3,4) (O t 4 + (4/2) fi,4(^) T 4- 

Transposing this expression introduces another minus sign, since e R and <? R ' are 
fermion fields and hence anticommute. We then recover /. 


12.6 Mass terms in £: an attempted generalisation 

For later use, when the theory is extended to quarks, we finish this chapter by 
contemplating a possible generalisation of our Lagrangian density. The coupling 
of the three lepton families to the Higgs field was taken to be 

1=1 

where the sum is over the three lepton families, and we have modified the notation 
of (12.8) in an obvious way. We might have taken a more general coupling, 

CL = - E + G[,r )($>%)_. 

This preserves the t/(l) x SU{2) symmetry with Gjj any 3x3 complex matrix. 

We wish to show that this form has no essential difference from that already 
introduced. This is because an arbitrary complex matrix can always be put 
into real diagonal form with the help of two unitary matrices, Ul and Ur 
(Appendix A): 


G - U L 'CU R 


with Cjj = 0 for i ^ j. 
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Ul and Ur are in general unique, except that both may be multiplied on the left 
by the same ‘phase factor' matrix 

/e i “ 1 0 0 \ 

0 e“ 2 0 

\0 0 e‘“ 3 / 

If we define rf = U^rj, Lf — UujLj we recover the original form for the 
coupling to the Higgs field. Since the dynamical terms in the Lagrangian density 
are of the same form after these unitary transformations (Problem 12.5), £^“ s is 
just a more complicated expression of the same physics. The three phase factors 
exp(io , <.) correspond to the three U( 1) symmetries which lead to electron, muon, 
and tau number conservation. 


Problems 

12.1 Set the fields W t , to be zero, and consider the dynamical Lagrangian density 

£, = “i (3 m + i(g'/2) B, l ) L. 

With the gauge transformation (11.4b), 

V = B r + <2/gi)W , 

show that £j is invariant if L transforms as 

L ^ L'= expl-iU'/gO^L. 

Now set the fields B fJL to be zero, and consider 

£ 2 = Ltor'*iO ;t +i(s72)W„)L. 

With the gauge transformation (11.6), 

-> w; = UW^Ut + (2i/g 2 )(9 M U)U t , 

show that £ 2 , can be made invariant only if 

L -* L' = UL and g' = g 2 . 

12.2 Show that, to conform with the mathematical structure of Chapter 11, if two 
fields are to be put together in an SU(2) doublet then they must differ by e 
in electric charge. 

12.3 Inspection of (12.9) shows that the Higgs boson can decay into an e + e~ pair. 
Show that, in the rest frame of the Higgs particle, the electron and positron must 
have equal and opposite momenta and the same helicity (i.e. both positive or both 
negative). 
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Show that the final density of momentum states for the decay is 

where p c and E e are the momentum and energy of the electron. 

Calculate the matrix elements for the transition, and hence show that to lowest 
order in perturbation theory, 

total decay rate = ——mu ( — ) , 

167T V c / 

where v e is the electron velocity. 


12.4 


Show that the ratio of the leptonic partial width of the Higgs particle to its mass is 
approximately 


1 

16 t r 



2 x 1(T 6 . 


12.5 Verify that the unitary transformations of Section 12.6 preserve the form of the 
dynamical terms in the Lagrangian density. 





13 

Experimental tests of the Weinberg-Salam theory 


13.1 The search for the gauge bosons 


We saw in the preceding chapter that the low energy limit of the electroweak 
Weinberg-Salam theory reduces to the successful phenomenology of Chapter 9. 
There is no reason to doubt that the Weinberg-Salam theory describes all low energy 
(3 decays, but it also describes very much more. The pathological cross-section of 
equation (9.14) is modified to 




w) 


Gy i i s ~ »V 2 ) 2 \ 

Jr \s[ 1 + (s - m v l 2 ) /M w 2 ] J 


(13.1) 


At high energies M w , this expression tends to Gf 2 M w 2 /tt = 1.08 x 10 -1 ° b. 
It is a renormalisable theory, so that quantum corrections can be calculated. At 
high energies these corrections become increasingly important (at the few per cent 
level). 

The clearest test of the theory is the observation of the conjectured gauge bosons, 
the W f and Z. These were discovered at CERN in 1983, using a specially con¬ 
structed proton-antiproton collider, with a centre of mass energy of 540 GeV. It 
was very important for the successful identification of the new particles that their 
masses and decay characteristics had already been well estimated within the the¬ 
ory. The masses depend on Gp, e and the Weinberg angle 0 W (equations (11.37) and 
(12.22)). The values of Gp and e were well established, and estimates of # w were 
available from careful observations of neutral current events. We saw in Section 
12.3 that the ev^ —> cv Ll and c"v u —> ev u cross-sections arc sensitive to 0 W . Simi¬ 
larly, the cross-sections for v and v scattering from nuclei depend on 0 W , as we 
shall see in more detail in Chapter 14. Since the centre of mass energy available 
in neutrino-nuclear scattering is much greater than in neutrino-electron scattering 
(equation (9.13)) and the cross-sections increase with energy, it was the neutral 
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Figure 13.1 Quark-antiquark annihilation is the principal process contributing 
to W and Z production in proton-antiproton collisions at present day collider 
energies. 

current experiments on nuclei which gave an estimate of (9 W , and this estimate was 
in fact close to the presently accepted value. The experimental physicists knew 
what to look for! 

The successful identification of the new particles also relied on estimates of the 
likely production cross-sections of the particles. We have not yet discussed how 
quarks interact with the W ± and Z bosons, but we shall see in Chapter 14 that the 
interactions are similar to the interactions of leptons with the gauge bosons. Two 
of the processes that contribute to Z and W + production are sketched in Fig. 13.1. 
The outgoing proton and antiproton remnants materialise as complicated jets of par¬ 
ticles moving in directions closely correlated with the original proton and antiproton 
directions. It is a fortunate circumstance for identification that the decay products 
of the gauge bosons are frequently well separated from the particles in the remnants 
(Problem 13.1). 

The quark-antiquark pair responsible for gauge boson production carry only a 
fraction of the original 540 GeV of energy, and the 540 GeV design parameter 
allowed for this effect. The important analysis of the partition of the energy of a 
beam particle between its constituents is discussed in Appendix D. 

13.2 The W ± bosons 

The results of these experiments at CERN and subsequent experiments dramatically 
confirmed the theoretical expectations. The charged '^/ ± bosons have a mass 

M w = 80.425 ±0.038 GeV, 

and their decay rates to lepton pairs are measured to be 

T(W+ -* e+-v e ) = 228 ± 6MeV, 

T(W+ -* p+vv) = 225 ± 9MeV, 

T(W+ -* r+-v T ) = 228 ± 11 MeV, 

andT(W + —»■ e + v^ e ) = T(W“ —»■ e - ^), etc. 
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To lowest order in perturbation theory, and neglecting terms in (m i ep t 0 n/M w ) 2 , 
these partial widths are all equal in the Standard Model and 

. . g f m 3 w 

F(W+ -> e+w) = ™ = 226 ± 1 MeV, (13.2) 

6ttV2 

(Problem 13.3) in good agreement with the experimental data. 


13.3 The Z boson 

The experiments that revealed the charged W ± bosons also revealed the neutral Z 
boson, but the mass of the Z boson and its decay rates are now known far more 
accurately than those of the W 2 " bosons. In 1989, two e + e“ colliders were opened: 
LEP at CERN and SLC at Stanford. In these machines, the electrons and positrons 
have equal energies and opposite momenta, and the centre of mass energy can be 
tuned to lie at and around the mass of the Z. Typical resonant cross-sections for 
particle production are shown in Fig. 13.2, and corresponding Feynman diagrams 
in Fig. 13.3. At the peak energy, Z bosons at rest are copiously produced by e + e“ 
annihilation. These very clean events have given precise data on the properties of 
the Z. The mass of the Z is 


M z = 91.1876 ± 0.0021 GeV, 


and partial decay widths to charged lepton-antilepton pairs are 


T(Z e + e“) 
T(Z -* pV) 
T(Z -* t + t“) 


= 83.91 ± 0.20MeV, 
= 83.99 ± 0.35 MeV, 
= 84.09 ± 0.40 MeV. 


The total decay width, which includes decays to hadrons and the vv pairs, is 
T (total) = 2495 ± 2 MeV. 

The theoretical partial widths for decay to charged lepton pairs depend on the 
Weinberg angle # w . To lowest order and neglecting terms in (m\ epton /M z ) 2 , the 
partial widths are all equal and 


T(Z 


e + e 


' [(l — 2sin 2 0 w )“ + 4sin 4 6U. 
12V27T 


(13.3) 


Taking the accepted value of sin 2 9 W = 0.2312, this gives, to lowest order, 


T(Z -* e + e“) = 83.4MeV. 


Again, there is remarkable agreement between theory and experiment. 
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of E the initiating e + e _ centre of mass energy. The experimental data were pre¬ 
sented at the 25th International Conference on High Energy Physics in Singapore 
in 1990 by the ALEPH collaboration of CERN. The curve is the prediction of the 
Standard Model but with parameters such as the Z mass as variables determined 
by the data (see Hansen (1991)). 


13.4 The number of lepton families 

For the decay rates to neutrino-antineutrino pairs, the Standard Model gives 

G f M z 3 


r(Z -* -Ve-Ve) = r(Z — -VfVu) = r(Z -* -v T w T ) = 


12V27T 


= 165.9 MeV. 


(13.4) 


Hence the partial width for decay to any neutrino-antineutrino pair is 


3r(Z -* -VeVe) = 497.6MeV. 

This can be compared with the partial width T(invisible) associated with e + e“ pairs 
annihilating without trace, since neutrinos and antineutrinos are the only particles 
that will escape unseen by the particle detectors. 
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Figure 13.3 The basic Feynman graphs that describe the processes of Fig. 13.2. 
The fitting curve indudes additional graphs that give the Z resonance its width and 
graphs that describe accompanying electromagnetic processes. 


Experimentally, it is found that 

r(invisible) = 498.3 ± 4.2 MeV. 

The agreement with the Standard Model value is a striking confirmation of the 
theory. It implies that there are no more light neutrino types and rules out there 
being any more ‘standard’ lepton doublets in Nature than the three already known. 
This is a result of fundamental significance. 


13.5 The measurement of partial widths 

In view of the importance of the partial widths for Z decay, we shall sketch how 
they are obtained from the experimental results. The cross-section for e + e“ elas¬ 
tic scattering at small angles is dominated by photon exchange, even around the 
Z resonance, and is well known from QED. This small angle elastic scattering 
of the beam particles is constantly monitored during data taking, and the cross- 
section for any other process, for example e + e“ —> p+pr, is then obtained from 
the measured rate of p+pr production relative to the rate of e + e“ small angle 
scattering. This, essentially, is how the graphs of Fig. 13.2 are arrived at. We 
give now a much simplified analysis that indicates how the partial widths are 
extracted. 
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Assume that the cross-sections are described by a simple Breit-Wigner formula. 
For example, 


a (e + e~ 


o(e + e 


II 

1 

=L 

+ 

=L 

37t r ee r^ 

(13.5) 

M- 2 (E - M,f + r 2 /4’ 

hadrons) — 

3?T reeThad 

(13.6) 

M 2 (E - M z f+ r 2 /4' 


(The factor 3 is a spin factor.) 

M z and the total decay width T can be found from the position and width of the 
experimental peak. Then, taking r ee = r u|l the ratio T ee / T can be found from the 
peak of the cross-section cr (e + e“ —> q + p. _ ) atE — M z , using (13.5): 


^M z 2 cr(e + e —> p + p at E = M z ) 




\2n 


Using this result, the ratio T had / Y follows from the peak of the cross-section 
cr(e + e“ —> hadrons). From (13.6), 

rhad a / z 2 r , , _ , 

—— =-o (e + e hadrons at E = M z ) . 

T 12 tv r ee V ’ 

To obtain r(invisible), we take 


T (invisible) = T - 3r ee - r had . 


In reality the data have to be treated very much more carefully than is implied 
above. In particular electromagnetic effects during the collision process distort the 
simple Breit-Wigner shape, and appropriate corrections are applied in the actual 
analysis. 

Figure 13.4 shows the result of such a more sophisticated lit, compared with Stan¬ 
dard Model predictions assuming two, three and four types of massless neutrinos. 
The data unequivocally require three. 


13.6 Left-right production cross-section asymmetry and lepton decay 
asymmetry of the Z boson 

Other details of the Weinberg-Salam theory can be tested with e + e“ colliders. Much 
work has been done at Stanford with the SLC beam energies tuned to the Z boson 
mass. The beam intensities at SLC were lower than those at the CERN collider, 
but the SLC had an advantage in that the electron beam can be polarised along 
the beam direction so that the relative proportions of positive and negative helicity 
electrons can be changed. We have seen in Chapter 7 that, at high energies, negative 
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Figure 13.4 The cross-section a (e + e —»■ hadrons) as a function of E the ini¬ 
tiating e + e~ centre of mass energy. The experimental data were presented at the 
25th International Conference on High Energy Physics in Singapore in 1990 by the 
OPAL collaboration of CERN. The data are compared with the predictions of the 
Standard Model but with two, three and four neutrino types. Three light neutrino 
types are clearly favoured (see Mori (1991)). 
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Figure 13.5 The differential cross-section da (e + e —»■ u 1 p ) /d cos 0. The data 
were taken at DESY at an e + e~ centre of mass energy of 30 GeV. The dashed line 
is the prediction of quantum electrodynamics alone, the full line fits the data 
and shows the modification due to the presence of the Z boson which gives this 
interference effect (R. Marshall, Rutherford Appleton Laboratory Report RAL 
89-021). 


helicity electrons and positive helicity positrons are associated with left-handed 
fields, positive helicity electrons and negative helicity positrons are associated with 
right-handed fields. It follows from the form of the interaction term (12.33) in the 
Weinberg-Salam Lagrangian that in interacting with an unpolarised positron beam 
(equal numbers of positive helicity and negative helicity positrons) the cross-section 
a L for Z production by a negative helicity electron is proportional to (cos 2 0 W ) 2 and 
the cross-section er R for Z production by a positive helicity electron is proportional 
to (2 sin 2 0 W )“. The constants of proportionality are the same so that the left-right 
cross-section asymmetry is, to lowest order, 

a L — or (cos 2# w ) 2 — (2sin 2 0 w ) 2 2(1— 4 sin 2 0 W ) 

A L r = - = - y = - y. 

o"l + or (cos 2# w ) 2 + (2sin 2 0 w )“ 1 + (l — 4sin 2 0 w ) 

From the measurements at SLC (Fero, 1994) it is calculated that A L r = 0.1628 ± 
0.0099, which gives an estimate 

sin 2 9 W = 0.2292 ± 0.0013. 
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This estimate does not depend on the ratio M w /M z , since the W ± bosons are not 
involved. 

At CERN and at a previous e + e“ collider at DESY in Hamburg the electron 
beams had no longitudinal polarisation. Nevertheless if a Z boson is formed its spin 
is aligned with the direction of the electron beam with probability proportional to 
[2 sin 2 # w ] 2 , and anti-aligned with probability proportional to [cos 20 w ] 2 , giving it 
a mean polarisation in the direction of the beam of — A LR . 

When the Z decays to a lepton-antilepton pair, the direction of the lepton is 
correlated with the direction of the Z spin. The polarisation of the Z therefore gives 
a forward-backward asymmetry in the angular distribution of the leptons. 

The competing process of lepton production through the electromagnetic interac¬ 
tion does give a symmetrical angular distribution. The observed asymmetry depends 
on the interference between Z and y processes, and is energy dependent. Figure 13.5 
shows the angular distribution of leptons with respect to the electron beam distri¬ 
bution at a centre of mass energy E — 30 GeV (which is below M z ). This data was 
taken at DESY and gave an estimate of sin 2 6 W = 0.212 ± 0.014. This is another 
impressive confirmation of the overall consistency of the Weinberg-Salam theory. 


Problems 

13.1 bosons are produced when a beam of high energy protons is in head-on col¬ 
lision with a beam of antiprotons. The W boson momenta are strongly aligned 
with the beams. The transverse component of momentum given to the W is small. 
Neglecting this component, and assuming that in the W rest frame there is an 
isotropic distribution of decay products, show that in a decay to a charged lepton 
and a neutrino, the root mean square transverse lepton momentum is approximately 
M w /V6 = 33 GeV. 

Events with large transverse momenta are rare, and their observation allows W 
production to be identified. (Note that the transverse momenta are unchanged by a 
Lorentz boost of the W in the beam direction.) 

13.2 From the interaction term in (12.23) of the Z boson with an electron-positron pair, 
show that in head-on unpolarised e + e _ collisions, the probability of the Z boson 
spin being aligned with the electron beam is proportional to (2 sin 2 0 W )~, and of 
being antialigned is proportional to (cos 20 W ) 2 . 

13.3 Neglecting lepton mass terms, obtain the partial widths (13.2), (13.3) and (13.4). 

13.4 Recalculate (13.3), taking cos <9 W = M w /M z . 
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The electromagnetic and weak interactions of quarks 


In the Standard Model it is the quarks’ colour that is the source of their strong 
interaction. In this chapter we shall consider only the electromagnetic and weak 
interactions of quarks, and colour will not enter. The theory will be constructed in 
close analogy with the electroweak theory for leptons set out in Chapter 12. The 
theory for quarks is not as well founded in experiment as the theory for leptons. 
This is because quarks cannot be isolated from hadrons. Experiments can only 
be performed on composite quark systems, and the basic Lagrangian density is 
obscured at low energies by the strong interactions. At higher energies, and espe¬ 
cially through the hadronic decays of the Z bosons, the electroweak physics of the 
isolated quarks can to some extent be discerned. In Chapter 15 some of the relevant 
experimental data on these decays will be described. 


14.1 Construction of the Lagrangian density 

At low energies, the model has to describe decays like 


n —p + e + w e 


or, at quark level, 


d —> u + e + w e . 


This decay is mediated by the W boson. Comparing it with muon decay, 


M- -* "Vn + e + -v e , 


which is also mediated by the W boson, suggests that the left-handed components 
ml andc/| of the quark fields should be put together in an SU( 2) doublet, 



(14.1) 
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while h r and d R are, like v R and e R , unchanged by 517(2) transformations. We shall 
see that this simple assignment would be correct if Nature had provided us with 
only one type of up quark, and only one type of down quark. 

With such an assignment there is no freedom in the construction of the weak inter¬ 
action. There is only one way to make the dynamical part of the quark Lagrangian 
density gauge invariant. The coupling to the field W d is uniquely determined by 
517(2) symmetry and the coupling to the field B u is fixed by the quark electric 
charges: 2e/3 on the u quark, —e/3 on the d quark. Hence 

Aiyn= L t d’ /X i[3 /X + (ig 2 /2)W^+ (igi/6)B^]L 
+ /^R^iKt + (2igi/3)£,jHR 

+ 4*^ - (14.2) 


where g 2 sin@ w — g { cos # w = e. 

To conform with the transformation laws (11.4b) and (11.6) on the gauge fields, 
the 17(1) x 517(2) transformation of the quark fields must be 

L L' = e“ iew/3 UL, 

Hr — Hr' = e“ 4lS(l)/3 H R , 

d R -* d R ' = e 2iew/3 r/ R . (14.3) 


Using (11.17) and (11.29), £ dyn can be written in terms of the fields W±, and 


4 U and becomes 


Alyn = L 


/ 2i^ 

dn - An + - 

M 3 ^ 3 sin 26, 


le le 

(1 + 2 cos20 w ) Z /t , 


le 


te 


W~, 3m - - 


te 


+ H^CT^l 

+ dlo IL i 


\ \fl sin 9 W ^ " 3 M 3 sin 29 v 

Hr 


V2 sin 9 V 
(2 + cos 20 W ) Z 


|x ’ M- 1 




2ie 2ie 

3^ + —Am - — tan(9 w Z m 


h y 
(14.4) 


ic ic 

3 m - y^M + y tan (9 W Z 


W^/X 


<7r. 


However, the Standard Model postulates three families, or generations, of quarks. 
We therefore introduce three left-handed 517(2) doublets: 

/h L i\ /h L 2 \ /wu\ 

\d Ll ) ’ Ul2 7 ’ Ul.3 ) ’ 


and six right-handed singlets: h R i, c/ri ; hr 2 , c/ r2 ; h R 3 , <7 R 3. For a more compact nota¬ 
tion we shall denote these by 


u 



hr k, d^withk — 1,2,3. 
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As in the lepton case, we take the dynamical part of the total quark Lagrangian as 
a sum: 

3 

£ dyn (quark) = E £ dyn (u k , d k ). (14.5) 

k= 1 


14.2 Quark masses and the Kobayashi-Maskawa mixing matrix 

To retain renormalisability we must retain gauge symmetry, and give mass to the 
quarks by coupling to the Higgs field as in Chapter 12 where we gave mass to the 
leptons. For the d k quarks this is straightforward. The most general form we might 
consider that preserves the gauge symmetries is 

Ati ggs (d) = - E Li) ], (14.6) 

as we discussed in the lepton case in Section 12.6. After the symmetry breaking of 
the Higgs field <t>. this gives the mass term for the d-type quarks: 

Anass(d) = - 00 E K-4A; + Gfjd^du], (14.7) 

A priori, G d j is an arbitrary 3x3 complex matrix. As we remarked in Section 
12.6, such a matrix can always be put into real diagonal form with the help of two 
unitary matrices, so that we can write 

0oG d = D|m d D R , 

where m rf is a real diagonal matrix, and Dl, Dr are unitary matrices. If the diagonal 
elements are distinct, as appears experimentally to be the case, Dl, Dr are unique, 
except that both may be multiplied on the left by the same phase-factor matrix 


/ e io?1 0 0 \ 

0 e‘“ 2 0 . (14.8) 

\0 0 e'“ 3 / 


In the Standard Model as set out in Chapter 12, the neutrinos were taken to have 
zero mass. However, for the u-type quarks, which are here making up a left-handed 
doublet, we need a mass term. For this purpose we introduce the 2x2 matrix in 
SU( 2) space 


( Saa Sab 
Sba Sbb 



A suitable SU(2) invariant expression which we can construct from the doublets $ 
and Li is (<f* T e L;), where <E* T = (<f» A , 4> b ) is the transpose of $ (Problem 14.3). 
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We then take 

w«) = - E i G ‘j ( L ! £ - G U “L-(* T e L/)] ( 14 -9) 

ij 

where G“- is another complex 3x3 matrix. On symmetry breaking, this gives the 
u-quarks mass term 

Anass(«) = + G “; M L«u], (14-10) 

which is, as we might expect, similar to (14.7), and likewise preserves the gauge 
symmetries. It can be brought into real diagonal form in a similar way: 

0ioG u = U L WU R , 

where Ul and Ur are unitary matrices, and m" is diagonal. 

Ul and Ur may be both multiplied on the left by a phase factor matrix, say 

/e ift 0 0 \ 

0 e ih 0 

\0 0 / 


The theory is most directly described in terms of the ‘true’ quark fields, for 
which the mass matrices are diagonal, so that we define the six quark fields: 
d u = D\jjd\j , d Ri = D^ijdnj, 
u i j = h'i .ij u i. /, u R j — UjujURj. 

The quark mass contribution to £ becomes: 


(14.11) 


Anass(q Uarks ) = - [ m i( d Li d K + <4 d u) + + «Rf“u)]- 

(14.12a) 


!= 1 


We identify the Dirac spinors 



with the u, c and t quarks, respectively, and the Dirac spinors 


<i 


d L2 
. d R2 , 


<3 

^R3, 


with the d, s and b quarks, so that we might rewrite (14.12a) as 

Anass(q uarks ) = - [m d {d\d R + d' R d L ) + m u (u\u R + MrMl)] 

- [m s (4>s'r + 4^) + »^( c l c R + c r c l)] 

- [m h (b\b R + b R b L ) + m'(t[t R + 4*l)]- (14.12b) 

The terms in (14.12b) correspond to six Dirac fermions. 
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We have dropped the primes, and for the remainder of the book tq. and r4, for 
k = 1, 2, 3, will denote true quark fields. 

In the £ dyn given by (14.2) and (14.5), the ‘diagonal’ terms do not mix u-type and 
d-type quarks and are invariant under the unitary transformations (14.11). However, 
the terms that arise from the off-diagonal elements of the matrix W d , mix u and d 
quarks through their coupling to the W ± boson fields, and these terms are profoundly 
changed. 

The diagonal terms give £ Dirac and £ r/ _ that parallel the expressions (12.12) and 
(12.23) of the lepton theory of Chapter 12. The complete electroweak Lagrangian 
density for the quarks is 

£ = £ „. + £ + £ 4- £ u 

q gDirac 1 qz 1 qw 1 

where 

£ 9 Dirac = [“L 5 ^ 9 /* + i(2e/3 )Af JL }u U + + i(2e/3 )A #4 }m r/ ] 

i 

+ [d ] u a ll \ {d fl -i(e/3 )A fl }d u + d ] Ri a ll \ {d tl - i(e/3 )A ll }d Ri ] + £ ?mass 

(14.13) 


£ ,.= E 


-ulid^uu 


+ dl i a ,l d u 

^Ri ^ ^Ri 


sin (2 0 W ) 

<? \ _ 4 


2^(1 — (4/3) sin 2 0 W ) 


sin (26> w ) 
e 

sin (20 w ) 
e 

sin (2 0 W ) 


Z u -sin 6» w 


Zu(l — (2/3) sin 2 £? w ) 


Z„ - sin 2 


In the £ ?w , part of the Lagrangian density, the terms 


+ dla^uuW^], 


(14.14) 


s/2 sin 

when written in terms of the ‘true’ quark fields given by (14.1 1), become 

'Kd Ks Kb\ (V l dC 


£qw s/2 sin 9 V 


( 4 , 4 , 4 ) V cd Vcs c cb I I d>\s L I Wj 

V, d V te Vtb / \d^ t 


(14.15) 


+ Hermitian conjugate, 


where V = UlD^ . 

Since the product of two unitary matrices is unitary, V is a 3 x 3 unitary 
matrix. The elements of V are not determined within the theory. It is in this 
matrix that another four of the parameters of the Standard Model reside. An 
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n x n unitary matrix is specified by n 2 parameters (Appendix A), so we appar¬ 
ently have nine parameters to be measured experimentally. However, five of these 
can be absorbed into the non-physical phases of the quark fields, through the phase- 
factor matrices associated with D L (see (14.8)) and Ul- (There are five, rather than 
six, non-physical phases since only phase differences appear in V. For example 
Kid = exp[i(y6 u - a d )] V u ° d .) 

When the quark phase factors have been extracted, the resulting matrix V° is 
dependent on four physical parameters. It is called the Kobayashi-Maskawa (KM) 
matrix (Kobayashi and Maskawa, 1973). 


14.3 The parameterisation of the KM matrix 

A 3 x 3 rotation matrix is also a unitary matrix. A more general unitary matrix 
can be constructed as a product of rotation matrices and unitary matrices made up 
of phase factors. There is no unique parameterisation of the KM matrix by this 
method. That advocated by the Particle Data Group is 

/1 0 0 \ / e ~ is / 2 0 0 \ / c 13 0 

V= 0 C23 *23 0 1 0 0 1 

\0 -*23 C 23 / \ 0 0 e lS/2 J V — *13 0 

/ e'K 2 0 0 \ / ci 2 *12 0\ 

X 0 10 *12 c 12 0 

\0 0 e~'K 2 / \o 0 1/ 

( C12C13 *12<03 

—*12*03 _ Ci 2 * 23 *l 3 e l5 *02*03 — * 12*23* 13 e 1<5 

*12*23 - Ci2C23*13e 1<5 -*02*23 “ *12C23*13e l5 

(14.16) 

wherec i; - = cos @, ; , Sjj = sin 0, ; . The four parameters are the three rotation angles 
@ 12 , @ 23 , @ 13 , and the phase 8. 

Evidently, if si 3 = 0 or sin 8 = 0 then V is real. Less evidently, if .v 1 2 = 0 then 
V is made real by redefining the quark fields 

e' s u i —► mi, e lS d i —> d \, 

and if 5 2 3 = 0 then V is made real by redefining 

e -1,5 M 3 —> m 3 , e~ lS dj —> r/ 3 , 



*i 3 e 1,5 \ 
*23^13 I 
^23^13 / 


as the reader may verify. 
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A general redefinition of the quark phases, 



will change the matrix elements of V by 



(14.17) 


Using this freedom, the three rotation angles can be chosen all to lie in the first 
quadrant. 

Jarlskog (1985) gives an important necessary and sufficient condition for deter¬ 
mining whether, given a unitary matrix V, it is possible to make it real by such 
changes. She considers the imaginary part of any one of the nine products, 
VijVkiVZV?, with i rjz k and j ^ /, for example 


im ( Vn V22V21 V* 2 ) = J say. 


(14.18) 


J is invariant under a general phase change (14.17), so that if J is not zero then it 
cannot be made so, and hence V cannot be made real. All nine quantities are equal 
to ± J. In the parameterisation of equation (14.16), 


J = C 12 C 13 C 23 U 2 U 3 S 23 sin 5. 


(14.19) 


(The conditions already obtained for the reality of the KM matrix are contained in 
the condition J = 0.) 

Having fixed the KM matrix there remains only one global U{ 1) symmetry which 
leaves it unchanged. All six quark fields, left and right, can be multiplied by the 
same phase factor. As a consequence, only the total quark number current and hence 
the total quark number is conserved. At the macroscopic level this is observed as 
baryon number conservation. 


14.4 CP symmetry and the KM matrix 


We shall now show that, if the KM matrix cannot be made real by a redefinition 
of the quark phases, the Standard Model does not have CP (change conjugation, 
parity) symmetry. 

We saw in Section 12.5 that the Weinberg-Salam electroweak theory is invariant 
under the CP operation. Similarly, CP is a symmetry of every term in the Standard 
Model of the weak and electromagnetic interactions of quarks, except for those 
terms that give the interaction between the quarks and the W bosons. These are the 
terms that involve the KM matrix. 

The CP transforms of the W fields are defined in equation (12.32): 
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and the quark fields transform like all fermion fields: 


q[ p = -io 2 ql, qC p = ia 2 q*. 


To show how CP symmetry is violated, we consider the terms (14.15), which we 
write as 

(-e/V2sin0 W ) Yl[ u u dlX Vijd Lj W+ + 4/^buW;] 

U 

(i = u, c, t; j = d, s, b) 


Replacing the fields by their CP transforms gives 

(-e/V2sin0 w ) ^[-uU^fV^W- - dl J (^ l ) p V* J u* u W+] 

ij 

where, as in Section 12.5, we have used the results 

(a 2 ) 2 = 1, ffV'ff 2 = -(ct') T . 


On transposing this expression with respect to the spinor indices we introduce a 
minus sign from the anticommuting fermion fields, and obtain the CP transformed 
expression 

(-e/V2sin0 w ) [ d lj^VijUuW~ + d ] u (f' l V* j d Lj W+]. 

i,j 

This is the same as the original term if and only if Vq is real for all ij. 

Experimental evidence for the breakdown of CP symmetry first became apparent 
in 1964, in the decay of the K° (ds) meson. We shall discuss this decay and its 
implications in Chapter 18, where we consider what is known experimentally about 
the parameters of the KM matrix. It is an interesting fact that CP-violating effects 
in the Standard Model are proportional to J. 


14.5 The weak interaction in the low energy limit 

Combining the results of Chapter 12 (equation (12.18)) with those of the present 
chapter (equation (14.15)), we have the complete interaction of the W bosons with 
all the fermions, both leptons and quarks, of the Standard Model: 

Avint = (—c/V2sin e w )[j^W+ + j»W~] 

where 

ju = cjjd'Wu + Y.dijd IJ u u V* (; = u, c, V, j = d, s,b). (14.20) 

leptons i j 
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Note that we have suppressed colour indices in this chapter. The labels i,j on the 
quark spinors in (14.15) carry with them implied colour indices which are also 
summed over. 

By eliminating the W field as in Section 12.2, we obtain the low energy effective 
interaction 


Avcff = -2 V2G F jlj'\ (14.21) 


For example, the part of this effective interaction which is basically responsible for 
all nuclear (3 decays involves the electron field and the u and d quarks (i =j = 1): 


£ eff = -2V2 G f 


g^vl L a»e L dla v u L V l 


ud 


+ Hermitian conjugate 


(14.22) 


That part of the effective interaction responsible for the decay K° —> 

7T + 7T - (i —»■ U + U + d) is 

4ff = -2V2G,[g llv .sld ll u L u[d v d L V* Vud]. (14.23) 


We have also the complete interaction of the Z boson with all the fermions. 
Combining (12.23) with (14.14) gives 


"^Zint — 


sin (2 0 W ) 


(./neutral). Z* 1 


(14.24) 


where 

( 7* neutral V 


X [ v u d ^ v u - cos(2(9 w ) < 4 i <J /x <?L/ 

leptons 

+ 2sin 2 0 w eRcr /i e R ] 

+ X “L^Li ( 1 - \ sin2 ) - u^cr^URi [ ^ sin 2 G v 


— d\id^du ( 1 — - sin 2 (9 W ) + d^a^dju ( - sin 2 9,, 


By eliminating the Z field, we obtain the low energy effective interaction 


"Zeff — V2j Oncutral)/i (./neutral )^ • 


(14.25) 
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Problems 

14.1 Verify that the transformations (14.3) along with (11.4b) and (11.6) leave £ dyn 
invariant. 

14.2 Obtain £ qZ , (equation (14.14)) from (14.4). 

14.3 Show that (4> e L) is an SU(2) invariant. (Show that U T eU = s det(LO) 

14.4 Write down the interaction Lagrangian density between the quark fields and the 
Higgs field, which appears in (14.6) and (14.9). 

Estimate the coupling constant c t between the Higgs field and the top quark. 

14.5 Which terms in (14.20) and (14.21) are responsible for the meson decays 


K+ (us) - 

► M + + "V/i, 

D+ (cd) - 

■> K° (ds) + e + + "V e 

B+ (ub) - 

-> D°(cu) + 7T + (ud)? 


Sketch appropriate quark diagrams. 

14.6 There are no ‘flavour changing neutral currents’, i.e. there are no terms in the neutral 
current of (14.24) that involve a change of quark flavour. Draw Feynman diagrams 
from higher orders of perturbation theory that simulate the flavour changing neutral 
current decays 

b —> s + y, b —s + e + + e~. 
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The hadronic decays of the Z and W bosons 


In Chapter 13 we described the results on the leptonic decays of the Z boson, 
obtained from experiments using e + e“ colliders. These results are in striking agree¬ 
ment with the predictions of the Weinberg-Salam electroweak model. In this chap¬ 
ter, we shall consider some of the wealth of data that has been accumulated at 
CERN and SLAC on the hadronic decays of the Z, and we shall find equally strik¬ 
ing agreement between experiment and theory. 


15.1 Hadronic decays of the Z 

In the Standard Model, a hadronic decay of the Z is most likely to be triggered by 
an initial decay to a quark-antiquark pair. The subsequent hadrons produced are 
mostly confined to two jets, back-to-back in the Z rest frame and made up of stable, 
or long lived, particles (see Fig. 15.1). The precise details of the processes involved 
in the creation of a jet are not fully understood. 

The momentum of a jet may be defined as the total momentum of the particles 
associated with it, and may be presumed to be equal to the momentum of the 
initiating quark or antiquark. The Z has sufficient rest energy to decay to any quark- 
antiquark pair other than a tt pair, but it has so far not been possible to identify jets 
as arising specifically from u, d or s quarks, or their antiquarks. However, many 
b quark jets can be identified with some confidence from the recognition of B 
mesons (bu, bd), which have a high probability of being produced in b quark jets, 
and a low probability of being produced in other jets. Similarly, B mesons are used 
to identify b jets. The observation of charmed hadrons in jets has likewise been 
used to identify jets arising from c quarks and c antiquarks. 

Associating the observed jets with the initiating quarks, comparisons can be 
made with the Standard Model predictions of Z decay rates to quark-antiquark 
pairs. We shall first consider the decay of a Z that is in a definite spin state. The 
interaction Lagrangian (14.4) has the same form for the d, s and b quarks, and in 
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Figure 15.1 A Z hadronic decay recorded by the OPAL detector at CERN. The 
charged particle tracks can be seen in the inner region. The dark bands around 
the outer circle indicate the angular distribution of energy deposited in the outer 
calorimeter. The figure gives a projection of the event onto a plane perpendicular 
to the beam axis (see Dydak (1990)). 


the lowest order of perturbation theory gives a differential decay rate into a did;, 
pair (d! — d, d 2 = s, d 3 = b) 


dr(d,d,) 
dcos 9 


3G F M Z 3 
32V2 7T 


^1 - ^ sin 2 (1 - cos 9) 2 
+ ^sin 2 6» w "j (1 + cos#) 2 , 


(15.1) 


where 9 is the angle between the direction of the d^ quark momentum and the 
direction of the Z spin. Similarly, the decay rate to a uu or cc pair is 


dr(u,u,) 

dcosS 


3 G F M Z 3 
32V2tt 


^1 — ^ sin 2 (1 — cos 9) 2 
+ sin 2 6> u ,^ (1 + cos 9) 2 


(15.2) 
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The colour factor of 3 is included in these rates. Terms in m q /M z are neglected. 
Integrating over 6 gives the total decay rates 


r(d*a*) = 

r(u k u k ) = 


G pMz 


3 r 


4V2tt L~ 3 

Gf M z 3 

. 3 


1-sin 2 0 W H— sin 4 6 , 


1 — - sin 2 6> w + — sin 4 @ w 


9 

32 

<T 


= 0.3677 GeV, (15.3) 
= 0.2853 GeV. (15.4) 


These numbers are obtained taking sin 2 0 w = 0.2315 (see Section 11.4). Adding 
the decay rates to all pairs gives a total decay rate 


T q q = 1.6737 GeV. 

This lowest order calculation is in quite good agreement with the experimental total 
hadronic decay rate, which is 


T 


experiment 


1.741 ±0.006 GeV. 


At the high energy of the Z boson, the effects of the strong interaction can be 
estimated with some confidence (Chapter 17). When additional gluon radiation is 
taken into account, the theoretical T qq is modified by a factor / = 1.038, and gives 

Theoretical = /T q q = 1.737 GeV, 


in very close agreement with experiment. 

The identification of bb jets and (less precisely) cc jets enables these partial 
decay modes also to be compared with the Standard Model. The estimates from 
experiment are 


T(bb) = 0.385 ±0.006 GeV, 
T(cc) = 0.275 ±0.025 GeV. 


The Standard Model values, (15.3) and (15.4) corrected by the factor/, are 

r(bb) (theoretical) = 0.3817 GeV, 

T(cc) (theoretical) = 0.2961 GeV. 


The agreement between theory and experiment is satisfactory. 


15.2 Asymmetry in quark production 

We noted in Section 13.6 that the SLC electron beam can be polarised to produce 
Z bosons with a much higher degree of polarisation than those produced at CERN 
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by unpolarised beams. From (15.1) there is a forward-backward asymmetry, with 
respect to the Z spin direction, in the angular distribution of b quarks in a bb pair 
produced by Z decay, given by 


Ar r(0 <9 < n/2) - Yin / 2 <6 < n) 
~r~ ~ r(0 <e < n/2) + r(7r/2 <0 <n) 



Taking sin 2 (9 W = 0.2315 gives AT/ T = —0.7016. At the peak of the Z mass dis¬ 
tribution electromagnetic interference effects are very small, and one can expect a 
forward-backward asymmetry in the b quark jets relative to the electron beam direc¬ 
tion. Measurements of b quark jets at SLC give a value of AT/T = —0.630 ± 0.075 
(Prescott, 1996). 

At LEP the Zs produced in e + e“ collisions are polarised along the direction of 
the electron beam with polarisation P, to give a forward-backward asymmetry of 
b quark jets with respect to the electron beam direction of 



From Section 13.6, taking sin 2 0 W = 0.2315 gives P = — A L r = —0.148, so that 

Ap B (theory) = 0.104. 

The experimental value (Renton, 1996) is 


Ap B (experimental) = 0.0997 ± 0.0031. 


The corresponding numbers for the c quark jets are 


Apg (theory) = 0.0719, 

Apg(experimental) = 0.0729 ± 0.0058. 


Again the Standard Model and experiment are in accord. 

A significant aspect of these asymmetry measurements is that an assignment of 
the right-handed rather than the left-handed quark fields to the SU{2) doublet would 
lead to an asymmetry of opposite sign. (The total widths would be unaffected.) The 
results vindicate the left-handed assignment. 

15.3 Hadronic decays of the W ± 

The e + e“ colliders give a clean source of Z bosons, but there is as yet no clean 
source of W 1 bosons. Consequently the experimental data on W 2 decays is less 
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precise than that for Z decay. The hadronic decays of a are, in its rest frame, 
like those of the Z: principally into two back-to-back jets, which are interpreted as 
the signatures of the initiating quark-antiquark pairs. 

Consider for example the decay of the W + to a quark u, (iq = 11,112 = c) and an 
antiquark d ; - (di = d, cU = s, d 3 = b). The coupling of the W + to the quark fields is 
given by £ qw (equation (14.15)), and depends on the elements V/ of the Kobayashi- 
Maskawa matrix. In the lowest order of perturbation theory, and neglecting quark 
masses, the differential decay rate to a pair u, d ; is 


dC 


'j 


dcos 9 


3 G F M W 
16\/2 7T 


■ |y, 7 l 2 (i -COS0) 2 , 


(15.5) 


where 6 is the angle between the direction of the u, momentum and the direction 
of the W+ spin. Integrating over 6 gives the total decay rate 


r(w+ -* u,a ; ) = 


G f M w 

2V2it 


■\V U \ 2 = (0.677 ± 0.006)|V,,| 2 GeV. 


(15.6) 


There is no data that resolves both initiating quark jets, so that we have no infor¬ 
mation from W decay on individual components of the KM matrix. However, we 
can sum over /, and since the KM matrix is unitary 

333 

E I ' V >j I 2 = E V ‘j V ij = E V ‘j V = 1 for « = 1, 2, 3. 


7=1 


7=1 


7=1 


Then summing over the possible , the u and c quarks, and including the factor/, 
we have 

T(all possible qq' pairs) = — F w ^ = 1.41 ± 0.008 GeV. 

V27r 

This value is in close agreement with the observed hadronic decay rate of the W + : 
T(hadronic) = 1.44 ± 0.04 GeV. 


Also, c quark jets can be identified with some confidence. From the above we would 
expect 

T (all possible cq' pairs) 

T (all possible qq' pairs) 

close to the measured value 0.51 ± 0.08. 

In conclusion, it would seem that we have no reason to doubt the efficacy of the 
Standard Model in describing the interactions of the Z and W 1 bosons with both 
leptons and quarks. The details of the KM matrix Vij remain undetermined by these 
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experiments, but it does pass two tests of unitarity. We have to rely on lower energy 
hadron physics to investigate the KM matrix more thoroughly, as will be discussed 
in Chapter 18. 


Problems 

15.1 Obtain the decay rates (15.3), (15.4) and (15.6). Note that quark masses have been 
neglected in these expressions (cf. Problem 13.3). 
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The theory of strong interactions: quantum 
chromodynamics 


The basic features of the quark model of hadrons were set out in Chapter 1 . Quarks 
carry a colour index, and interact with the gluon fields which mediate the strong 
interaction. 

We have seen that in the Standard Model the electromagnetic interaction and 
the weak interaction are well described by gauge theories. In the Standard Model 
the strong interaction also is described by a gauge theory. In this chapter we show 
how this is done. The theory is known as quantum chromodynamics (QCD) and 
has the remarkable property that in the theory quarks are confined, as appears to be 
the case experimentally (Section 1.4). In this chapter we concentrate exclusively 
on the strong interaction. The electromagnetic and weak interactions of quarks are 
neglected. 


16.1 A local SJJ(3) gauge theory 


In QCD, we have three fields for each flavour of quark. These are put into so-called 
colour triplets. For example the u quark is associated with the triplet 



where u r , u g , Ub are four-component Dirac spinors, and the subscripts r, g, b label 
the colour states (red, green, blue, say). 

We then postulate that the theory is invariant under a local S'1/(3) transformation 


q -* q' = Uq 


(16.1a) 


where q is any quark triplet, and U is any space- and time-dependent element of 
the group SU(3). The mathematical steps follow those of the SU( 2) theory of the 
weak interaction of leptons. We introduce a 3 x 3 matrix gauge field G ;/ , which 
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is the analogue of the matrix field of the electroweak theory. Under an 5(7(3) 
transformation, 

G,x -> G,/ = UG^U' + (i/g)(3 M U)Ut. (16.1b) 

We define 

D^q = (dv + igG M )q. (16.2) 

It follows that under a local 5(7(3) transformation 

D/q' = UD^q (16.3) 

where ff/q' = (d M + ig G /1 ')q / . The parameter g that appears in these equations 
is the strong coupling constant. 

G /; is taken to be Hermitian and traceless, like W /( in the electroweak theory, 
and hence it can be expressed in terms of the eight matrices k a set out in Appendix 
B, Section B.7: 

Z a =1 

where the coefficients G a (x) are eight real independent gluon gauge fields. (The 
factor | is conventional.) 

The Yang-Mills construction (cf. Section 11.2), 

G/jlv = 9/x G v , — SyG^ + iglG^Gy — GyG^), (16.5) 

leads to the result that, under 5(7(3) transformations of the form (16.1b), 

G' lv = UG^U+. (16.6) 

The gluon Lagrangian density is taken to be 

^gluon^Tr^yGH. (16.7) 

It follows from (16.16) and the cyclic invariance of the trace that £ gluon is gauge 
invariant. 

We can expand G My in terms of its ‘components’, 

G liV = \j2 G %K, (16.8) 

Z a= 1 

using equation (B.27) of Appendix B. Hence, using also the property (B.28), that 

Tr(k fl Afo) = 2 S a b, 
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the gluon Lagrangian density becomes 

£ giuon = -\j2 G %G a ^ v . (16.9) 

The quark Lagrangian density is taken to be of the standard Dirac form (equation 
(7.7)): 

6 

Aiuark = ~ ™/q/q/]> ( 16 - 10 ) 

/=! 

where the sum is over all flavours of quark and mf are the ‘true’ quark masses 
defined in Section 14.2. £ quark is evidently invariant under an SU(3) transformation 
(using (16.3)). The reader should note here the very compact notation that has 
been developed: as well as the explicit sum over flavours, there are sums over 
colour indices and sums over the indices of the four-component Dirac spinor and y 
matrices. It is perhaps instructive for the reader to write out the expression in full. 
The total strong interaction Lagrangian density is 

^strong A>luon Ajuark' (16.11) 

The eight gluon gauge fields have no mass terms. There is no direct coupling of 
the gluon fields to the Higgs field. The Higgs field is relevant in that it gives mass 
to the quarks. The field equations follow from Hamilton’s principle of stationary 
action. For the six quark triplets we easily obtain (cf. Section 5.5) 

(i - m f )q f = 0. (16.12) 

For the eight gluon fields, variation of the Lagrangian density with respect to the 
field G" gives (cf. Section 4.2) 

d ll G ai ^ v = j av (16.13) 

where 

r = sUabcG *G C ^ + f y v (Xa/2)q f ]. (16.14) 

/ 

Here f a b c are the SU(3) structure constants, defined by 

8 

[7.0, 7.£>] — X a hb — 2 1 fabc^-c- (16.15) 

C— 1 

(See Appendix B, Section B.7.) Their appearance here stems from the definition 
(16.5) of G^. 
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Since G'' 1 ' = — G l ’ M it follows that 


(16.16) 


and we have eight conserved currents. These are the Noether currents, which are a 
consequence of the SU( 3) symmetry taken as a global symmetry. We therefore have 
eight constants of the motion, associated with the time-independent operators 



(16.17) 


The field equations, and in particular the gluon field equations, are non-linear, 
like the equations of the electroweak theory. It is clear from (16.14) that both the 
quarks and the gluon fields themselves contribute to the currents j av which are the 
sources of the gluon fields. The quarks interact through the mediation of the gluon 
fields; the gluon fields are also self-interacting. 

Since the gluon fields are massless we might anticipate colour forces to be long 
range, which appears inconsistent with the short range of the strong interaction. 
However, the fields are known to be confining on a length scale greater than about 
10“ 15 m = 1 fm: neither free quarks nor free ‘gluons’ have ever been observed. 

In the electroweak theory, the ‘free field’ approximation in which all coupling 
constants are set to zero is the basis for the successful perturbation calculations we 
have seen in the preceding chapters. The free field approximation for quarks and 
gluons is not a good starting point for calculations in QCD, except on the scale of 
very small distances (< 0.1 fm) or very high energies ( > 10 GeV). For low energy 
physics, the equations of the theory are analytically highly intractable. Even the 
vacuum state is characterised by complicated field configurations that have so far 
defied analysis. There is no analytical proof of confinement. Confinement is not dis¬ 
played in perturbation theory, but numerical simulations demonstrate convincingly 
that QCD has this necessary property for an acceptable theory. 

16.2 Colour gauge transformations on baryons and mesons 

Since colour symmetry plays such an important part in the theory of strong interac¬ 
tions, it is natural to ask why it is not readily apparent in the particles, baryons and 
mesons, formed from quarks by the strong interaction. Here we attempt to answer 
that question. 

In Section 1.4 we asserted that baryons are essentially made up of three quarks, 
and mesons are essentially quark-antiquark pairs. We shall denote a three-quark 
state in which quark 1 is in colour state i, quark 2 is in colour state /, and quark 3 is 
in colour state k by \i, j, k), and take the colour indices to be the numbers 1, 2, 3. 
We have suppressed all other aspects (position, spin, flavour) of the quarks. In 
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Section 1.7 we saw that the Pauli principle required baryon states to be antisym¬ 
metric in the interchange of colour indices. The only antisymmetric combination 
of colour states we can construct is 

|state) = (1 /V6)s ijk \i, j, k), (16.18) 


where is defined by: 


e 123 = £231 = £312 - —£132 = —£321 = —£213 = 1 , 

and Sijk = 0 if any two of i, j, k are the same. (l/\/6) is a normalisation factor. 

How does this state transform under a colour 5(7(3) transformation? We restrict 
the discussion to a global (space- and time-independent) transformation, since a 
baryon is an object extended in space. We consider the quark fields to be trans¬ 
formed by q —> q' = Uq. In quantum field theory, these fields destroy quarks 
and create antiquarks. It follows that under the transformation the baryon state 
(16.18) will transform as | state) —> | state)' = (1 js/6)\a , b , c) U* n (7 ; * U* k £i,k■ But 
£ ijkU* i U^U* k = SabcdetV* = e a bc, since the determinant of an 5(7(3) matrix 
is 1. Thus we have the important result that under an 5(7(3) transformation, 
|state)' = |state). The transformation of the state is a trivial multiplication by unity. 
The state is said to be a colour singlet. 

Turning now to the mesons, we denote a state of a quark, colour i, and an 
antiquark of colour j by | i, j). Again, we have suppressed all other aspects of the 
quarks. Meson states are linear combinations 

Imesons) = (1/V3)(|1, I) + |2,2) + |3, 3)). (16.19) 

Under an 5(7(3) transformation, 

| meson) —► | meson) ' = (1/V3)| a,b)U* ai U bi . 

But U* ai U hl = U b iUj a = 8 ab , so that 

| meson)' = | meson). 

The meson states, like the baryon states, are colour singlets. 

In the quark model, we see that colour transformations have no effect on the 
observed particles. It can also be shown that the eight gluon colour operators Q a , 
defined by (16.17), give zero when they acton these states. Thus the 5(7(3) symmetry 
is well hidden by Nature: the particles are blind to the transformation of colour 
symmetry. These observations can be related to lattice QCD, in which calculations 
indicate that all the allowed states of the theory have this property. 
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16.3 Lattice QCD and asymptotic freedom 


Numerical simulations of QCD replace continuous space-time by a finite but large 
four-dimensional space and time lattice of points. The quark and gluon fields are 
only defined at these points. Sophisticated computer programs have been written 
that are capable of handling the lattice. Gluon fields are commuting boson fields. 
The quark fields are anticommuting fermion fields and pose a technically much 
more difficult numerical problem. In fact the first lattice calculations were done 
neglecting all quark fields, even those of the light u and d quarks, and thus excluding 
all effects of virtual quark pair creation and annihilation. In this so-called quenched 
approximation the Lagrangian density it taken to be the £„ luon of (16.9). £„ luon 
displays confinement at distances greater than about a fermi. 

At shorter distances, less than about 0.2 fermi, both £ gluon and the full QCD 
Lagrangian density display another important property, known as asymptotic free¬ 
dom. The effective strong interaction coupling constant becomes so small at short 
distances that quarks and gluons can be considered as approximately free, and their 
interactions can be treated in perturbation theory. 

To set the scene for the discussion of the effective ‘running’ strong interaction 
coupling constant, we first discuss the case of electromagnetism. 

At atomic distances ~ 10“ 10 m, the electrostatic interaction between an electron 
and a positron is given by the Coulomb energy V(r) = —e 2 /47tr. In the lowest order 
of perturbation theory, the amplitude for electron-positron Coulomb scattering is 
proportional to the Fourier transform V(Q 2 ) of V(r), 

V{Q 2 ) = f V(r)e iQr d 3 r = -e 2 /Q 2 , (16.20) 


where Q is the momentum transfer in the centre of mass system. 

In QED, this result is modified by quantum corrections: virtual e + e“ pairs created 
from the vacuum are polarised by the electric held of a charge, so that its measured 
charge at atomic distances is a ‘bare’ charge screened by virtual e + e“ pairs. At 
short distances the screening is reduced, so that the effective charge is greater. Per¬ 
turbation calculations in QED that include vacuum polarisation effects (Fig. 16.1) 
show that at large Q 2 , (16.20) is modified to 

" 2 1 


V(Q Z ) = - 


Q 2 1 - (c 2 /127T 2 )ln(<2 2 /4m 2 ) 


(16.21) 


where m is the electron mass. This result holds for large Q 2 2> 4 m 2 (but not so 
large Q 2 that the denominator vanishes!). Thus at large Q 2 we have an effective 
coupling constant 


a(Q 2 ) 


e\Qr) 

47T 


(e 2 /47i) 

1 — (e 2 /127T 2 ) ln(<2 2 /4m 2 ) ’ 


(16.22) 
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Figure 16.1 (a) The lowest order Feynman diagram representing single photon 
exchange. The corresponding perturbation calculation reproduces the result of 
(16.20). (b) The lowest order modification due to vacuum polarisation. Including 
this effect gives, at large Q 2 /m 2 , the result of (16.21). 


which increases as Q 2 increases (or, equivalently, as we probe shorter distances). 
Because e 2 /127T 2 ~ 10 -3 the effects of vacuum polarisation are small, but in atomic 
physics they have been calculated and measured with high precision. 

Similar vacuum polarisation effects occur in QCD, but the coupling is much 
larger and the consequences are more dramatic. If the scattering of a quark and 
an antiquark is calculated to the same order of perturbation theory as that used to 
obtain (16.22), then at large Q 2 the effective strong coupling constant a s (Q 2 ) is 
(see Close, 1979, p. 217) 


ois(Q 2 ) 


g\Q 2 ) 

4n 


_g 2 /4jr_ 

1 +(g 2 /167r 2 )[ll -(2/3)« f ]ln(<2 2 A 2 )' 


(16.23) 


In this expression k is a parameter with the dimensions of energy that replaces 
the electron mass appearing in QED. It is a necessary parameter associated with 
the renormalisation scheme, tif is the effective number of quark flavours. For very 
large Q 2 > (mass of the top quark) 2 , /if = 6, but /if is smaller at smaller Q 2 . The 
important point to note is that (11 — (2/3)/if) is a positive number. Thus, in contrast 
to what happens in QED, g(Q 2 ) decreases as Q 2 increases, and this is the basis of 
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quark 



Figure 16.2 There are Feynman graphs similar to those of Fig. 16.1 but for gluon 
exchange between quarks and antiquarks. An additional lowest order contribution 
to vacuum polarisation is associated with this Feynman graph coming from the 
gluon self-coupling. 


asymptotic freedom. As with QED the fermions contribute with a negative sign, 
but their contribution is outweighed by the virtual gluons that contribute the num¬ 
ber 11. The difference is due to the presence of gluon loops in QCD (Fig. 16.2). 
This property of QCD was discovered by Gross and Wilczek (1973) and Politzer 
(1973). 

Although renormalisation seems to necessitate the introduction of a second, 
dimensioned, parameter X, the effective coupling constant is in fact dependent on 
only one parameter. We can set 

? - [11 - (2/3)n/]lnA 2 = -^[11 - (2/3)/i f ] In A 2 , (16.24) 
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thus defining A, and then 


OtsiQ 2 ) = 


g 2 (Q 2 ) 

4:t 


4n 

[11 -(2/3)rc f ]ln(£ 2 /A 2 )' 


(16.25) 


This remarkable feature survives in all orders of perturbation theory. Higher terms 
in the expansion of a s (Q 2 ) are given in, for example, Particle Data Group (2005). 

A is well defined in the limit of large Q 2 , and it is standard practice to regard 
the one parameter A, rather than the two parameters g and X, as the fundamental 
constant of QCD, which must be determined from experiment. It is also interesting 
to note that we have replaced a dimensionless parameter g by a dimensioned one, 
A. Asymptotic freedom is displayed since a s (Q 2 ) —>■ 0 as Q 2 —»• oo. It is clear 
from (16.25) that perturbation theory breaks down at Q 2 = A 2 , when the effective 
coupling constant becomes infinite. Small values of Q 2 are associated with large 
distances, and the length scale A -1 is called the confinement length. 


16.4 The quark-antiquark interaction at short distances 


In QED, single photon exchange between an electron and a positron gives the 
Coulomb potential 


V(r) = 


= o^f ViQl) '-' a ' diQ = 


4nr 


a 
r ’ 


where V(Q 2 ) = —e 2 /Q 2 and a is the fine-structure constant. In QCD perturba¬ 
tion theory, single photon exchange is replaced by the sum of eight single gluon 
exchanges. To lowest order, the Coulomb-like potential between a quark and an 
antiquark in a colour singlet state and at a distance r apart may be shown to be (see 
Leader and Predazzi, 1982, p. 175) 

r 1 ^ „ , , 4 g 2 

47rr3 2" - _ 7 ■ ' '" (aa) - 


Vqcd(^) = — ^ 


E 


4jtr 12 


3 470- 

a a 

(16.26) 

The factor (1/3) is from the normalisation of the colour singlet state (see (16.19)). 
With quantum corrections, the effective potential at short distances becomes 

4 a s (r) 

Vqcd = 


where 


QfiO") 

r 


47T f 
(27T) 3 J 


a s(Q 2 ) 

Q 2 


e~ iQr d 3 Q. 


(16.27) 


7feo£A.&wta£teaf P/t^J ic.4 













162 


Theory of strong interactions: quantum chromodynamics 


This is a significant result for the charmonium cc and bottomonium bh systems, 
in which the heavy quark and antiquark are slowly moving. In these systems the 
colour Coulomb energy is the main contribution to the potential energy: colour 
magnetic effects are of relative order v/c. The behaviour of ct s {Q 2 ) at large Q 2 
gives the dominant contribution to Vqcd(^) at small r (Problem 16.5). We shall 
return to charmonium and bottomonium in Chapter 17. 


16.5 The conservation of quarks 

In addition to the 5(7(3) local colour symmetry, the Lagrangian density (16.11) has 
six global (7(1) symmetries: 

q f -* q/ = exp(ia f )q f . (16.28) 

In the Standard Model these remain global and are not elevated into local gauge 
symmetries. They imply conservation of quark number for each flavour of quark. 
Thus the strong interaction does not change quark flavour. Regarding mesons and 
baryons, the K + , for example, which can be denoted K(us) has u quark number 
1 and s quark number —1, the proton P (uud) has u quark number 2 and d quark 
number 1. Only the weak interaction, as exemplified in weak decays, can change 
quark flavour. Including the weak interaction, and in particular that part involving 
the Kobayashi-Maskawa mixing matrix, the six (7(1) symmetries reduce to one. 
Individual quark flavour numbers are not conserved, and only the overall quark 
number remains constant. 


16.6 Isospin symmetry 

The estimated masses of the u quark (1.5 MeV < m u < 4MeV) and d quark 
(4MeV < m L i < 8 MeV) are small compared with those of the s quark (100 MeV < 
m s < 300 MeV) and the heavy c, b and t quarks. The masses of the u and d quarks 
are also small compared with those of the lightest hadrons: the 7T° has a mass 
~ 135 MeV and the proton has a mass ~ 938 MeV. At low energies we may there¬ 
fore neglect all but the u and d quarks, and consider the Lagrangian density to be, 
as a first approximation, 

Aid = uiy /i (3 / _ i + igG^u + diy^ + igG^ )d - m u Qu - m d dd (16.29) 

where here G ;J is the gluon field matrix, evaluated from the field equations (16.13) 
with all but the u and d quark fields neglected. The fields u and d in (16.29) 
are triplets of Dirac fermion fields; colour indices and Dirac indices have been 
suppressed. 
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We now combine the u and d fields into an isospin doublet, 


D(.x) = 


/ u(jc)\ 

\ d(x )) 


and we can write 


(16.30) 


Aid = + igG M )D - (l/2)(m u + m d )DD - (l/2)(m u - m d )Dr 3 D 

(16.31) 


where 




and D = (u + y°,d t y°). 


£ ud is invariant under a global 17(1) transformation 

D -* D = exp(—ia°)D, (16.32) 

which leads (cf. Section 4.1) to the conserved quark current 

7^ = Dy^D = uy M u + dy M d. (16.33) 

It is also invariant under a global 17(1) transformation 

D -* D = exp(-iaV)D (16.34) 

which leads to the conserved current 

73 ^ = Dy M r 3 D = uy^u - dy^d. (16.35) 

(16.33) and (16.35) show that this Lagrangian density (16.31) conserves both u and 
d quark numbers separately. 

So-called isospin symmetry appears if we neglect the mass difference (m u — m A ). 
The resulting, simplified, Lagrangian density is invariant under the global 517(2) 
transformation 


D -* D = exp(-iaV)D (16.36) 

where the x k are the generators of the group 517(2) (Appendix B, Section B.3). 
In addition to the conserved current (16.35) we now have also the conserved 
currents 

Jy = Dy M r'D, = Dy M r 2 D (16.37) 

and the corresponding time-independent quantities 

I DVDd 3 *, k — 1,2,3. (16.38) 
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SU(2) transformations are equivalent to rotations in a three-dimensional ‘isospin 
space’. In analogy with the intrinsic angular momentum operator S = (l/2)er, we 
define the isospin operator I = (l/2)r; then 


I 2 = h 2 + h 2 + h 2 = (3/4) ^ ^ 


1 

2 





A u quark state is an eigenstate of I 2 and /3 with 7 = 1 /2, / 3 = 1 /2, and a d quark 
state is an eigenstate with I = 1/2,1-$ = —1/2. The mathematics of isospin is 
identical to the mathematics of angular momentum, and the formalism of isospin is 
very useful in understanding and classifying hadron states, as indicated in Chapter 
1. We see here its origin in QCD, with the neglect of the u — d mass difference and 
the electromagnetic and weak interactions. 


16.7 Chiral symmetry 

If we neglect entirely the quark masses, further approximate symmetries arise. These 
are of interest in particle physics. The Lagrangian density (16.31) may be written 
in terms of the left-handed and right-handed isospin doublets L = (1/2)(1 — y 5 )D 
and R = (1/2)(1 + y 5 )I). Neglecting the mass terms it becomes 

£ = Vi 5^ + igG M )L + RTa^ + igG„ )R. (16.39) 

L and R are now doublets of two-component spinors, and there are eight conserved 
currents: 

L/ct^L, Vs^h, Rct^R, RTo-V'R, it = 1,2,3. 

An important observation is that the currents and Vo 1 ' r 2 L couple to 

the W ± boson fields in the Lagrangian density (14.15), and appear in the effective 
Lagrangian density (14.22). The relevant quark factor in (14.15) is ULCT ,i d L Kd> and 
we may write 

uid"d L = L t CT M (l/2)(r 1 +ir 2 )L, 

d[of^u L = L t a /i (l/2)(r 1 - ir 2 )L. (16.40) 

This observation gives insight into the nature of the effective Lagrangian for (3 
decay, as we shall see in Chapter 18. 

The independent symmetry transformations 

L —► L' = exp[ — i(a° + a*r*)]L, R —► R 

and 

R R' = exp[ - i(/3° + 0*r*)]R, L ^ L 


7fea£A.&wta£teaf P/t^J ic.4 


Problems 


165 


may be written in terms of Dirac spinors as 

D — D = exp[ - i(a° + a k r*)(l/2)(l - y 5 )]D, (16.41) 

D ->• D' = exp[ - i(/3° + yS*r*)(l/2)(l + y 5 )]D, (16.42) 

respectively. 

The eight independent symmetry operations can also be taken as 

D ^ D' — exp[ - i(c/° + c/V)]D (16.43) 

which give conservation of quark number and isospin, and 

D D' = exp[ - i(yS /0 + p' k T k )y 5 ] D (16.44) 

The last four are known as the chiral symmetries. 

Problems 

16.1 Show that 

G% = O „G° V - 3 V G“) -g'E UcG^Gl. 

b,c 

16.2 Using Problem 16.1, show that the gluon self-coupling terms in the Lagrangian 
density (16.9) are 

A„t = giWf abc G b »G cv ) - (8 2 m abc f ade G b tl G c v G d ^G ev . 

16.3 Verify the expression (16.14) for the current j av . 

16.4 Estimate the value of Q for which V(Q 2 ) of equation (16.21) becomes infinite. 

16.5 From (16.27) show that 

OO 

2 C , , sinx 

a s (r) = - / a six /r )-dx. 

7T J X 

0 

(Note that the expression (16.25) for a s (x 2 /r 2 ) is only valid for x > A r , but for 
small r this range may be anticipated to give the main contribution to the integral.) 
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Calculations in QCD have been made in two ways: lattice simulations at low ener¬ 
gies, and perturbative calculations at high energies. In this chapter we outline some 
of the results obtained. 


17.1 Lattice QCD and confinement 

It was pointed out in Section 16.1 that, at low energies, a non-perturbative approach 
to QCD is needed. ‘Lattice QCD’ is such an approach. The gluon fields are defined 
on a four-dimensional lattice of points (n^, n)n, where a is the lattice spacing and 
the n 11 are integers. Field derivatives are replaced by discrete differences. This gives 
a ‘lattice regularised’ QCD. The lattice spacing corresponds to an ultraviolet cut-off, 
since wavelengths < 2 a cannot be described on the lattice. A lattice does not have 
full rotational symmetry in space, but it is believed that nevertheless continuum 
QCD corresponds to the limit a —*■ 0. Current computing power allows lattices of 
~(36) 4 points. The range of the strong nuclear force is ~ 1 fm. To fit such a distance 
comfortably on the lattice, we can anticipate that we shall not want a to be much 
less than (2fm)/36 = 0.056fm (and he /a > 3.5 GeV). 

In the high energy perturbation theory described in Section 16.3, the renormal¬ 
isation parameter X and the dimensionless coupling parameter g are combined to 
give a single physical parameter, A, having the dimensions of energy. The rela¬ 
tionship between the effective coupling constant a s (Q 2 ) and A in the lowest order 
of perturbation theory is given by (16.25). In lattice QCD, the unphysical lattice 
parameter a and the dimensionless coupling parameter g(a) combine to give a sin¬ 
gle physical parameter Ai att , having the dimensions of energy. In the lowest order 
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of ‘lattice’ perturbation theory, as a —> 0 then g(a) —0, 

2 — 16tt 2 

5 U 11 ln(a 2 Af att ) 


(17.1) 


(see Hasenfratz and Hasenfratz, 1985). 

Ai att is independent of a in the limit a 0. This remarkable feature of the theory 
is called dimensional transmutation. 

Equation (17.1) may be compared with (16.25) with nf set equal to zero. It can 
be shown theoretically (Dashen and Gross, 1981) that 

Aiatt 1 

-= constant ~ —. (17.2) 

A 30 


The precise value of the constant depends on the renormalisation scheme in which 
A is defined, and the number of quark flavours included. Ai att , or equivalently A, 
is to be determined from experiment. We shall see in Section 17.3 that A is known 
to be ~ 300MeV, so that Ai att ~ lOMeV. We can then infer from equation (17.1) 
that for a ~ 0.056 fm, the coupling constant g should be of order 1. 

Lattice QCD calculations have been made to compute the potential energy of 
a fixed quark and an antiquark in a colour singlet state, as a function of their 
separation distance. The form of this potential at short distances was discussed in 
Section 16.4. Non-perturbative lattice calculations have been made in the quenched 
approximation, excluding effects of virtual quark pair creation. 

In the lattice calculations, distances are measured in units of a, and energies in 
units of (1/a). A coupling constant g is chosen, and the quark and antiquark are 
localised on lattice sites that are spatially fixed at a distance apart of r = |n|a, where 
n is a set of three integers. The field energy E(r) generated by the quark-antiquark 
pair is computed for a sequence of separation distances, and is found to be of the 
form 


E(r) = 2 A + Kr - 


4 Q'iatt(r) 
3 r 


(17.3) 


where A and K are constants, and the factor (4/3) has been inserted to facilitate 
comparison with the perturbation results of Section 16.4. The constant 2A can be 
interpreted as a contribution to the rest energies of the quark and antiquark, and is 
absorbed into their notional masses to leave an effective potential energy 


V(r) = Kr - 


4 aiatt ( r) 
3 r 


(17.4) 


The results of such a calculation by Bali and Schilling (1993) using a (32) 4 lat¬ 
tice are shown in Fig. 17.1. In this calculation g = 0.97. The term Kr dominates 
at large distances. The constant K is called the string tension. In quenched QCD 
on a lattice, with g fixed, there is only one energy parameter a -1 (or A] att ). Hence 
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t in fm 


0.2 0.4 0.6 0.8 1.0 1.2 



Figure 17.1 The colour singlet quark-antiquark potential as computed on a lattice. 
For a fixed value of the coupling constant g (of order 1) V(r) is computed in lattice 
units (r in units of a, V in units of 1/a). The computed points are fitted with a curve 
of the form 

V(r) = 2A + Kr - (c/r) + (//r 2 ). 

In this example g was fixed at 0.97. The calculation determined K = 0.0148; 
K is the string tension in units of 1 /a 2 . The phenomenology of cc and b b quark 
systems suggests K ~ (440 MeV) 2 . Taking this value determines a = 0.055 fm 
and 1/a = 3.58 GeV. It also determines one point on the curve g(a) as a function 
of a. The calculations must be repeated to compute a for several values of g to 
check the extent to which the asymptotic form, like equation (17.1), is obeyed 
(Ai a tt is independent of a) in order to be confident of the continuum limit (Bali and 
Schilling, 1993). 


/Hias the dimensions of a~ 2 . Bali and Schilling (1993) find K — 0.01475(29)a -2 . 
In Chapter 1, Fig. (1.5) shows the experimental spectra of the heavy quark systems 
charmonium (c, c) and bottomonium (b, b). Many fits to these spectra have been 
made using a Schrodinger equation with an interaction potential of the form (17.3). 
In the lowest energy states of heavy quark systems, the quark and antiquark are 
slowly moving, so that a non-relativistic approximation is reasonable. The spec¬ 
tra are well fitted with K — (440 MeV) 2 — 1 GeV fm -1 , u(r) = constant = 0.39. 
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Taking K = (440 MeV) 2 fixes the lattice spacing a = 0.0544 fm, and a 1 = 
3.62 GeV. 

Equation (17.1) could now be used to estimate Ai att . However, this equation (and 
more sophisticated extensions to higher orders of lattice perturbation theory) hold 
only in the limit a —»■ 0. To extract A] att reliably, the calculations must be repeated 
for different values of g. The corresponding values of a follow from the string 
tension. The limit Ai att as a —> 0 may then be estimated. Bali and Schilling (1993) 
found y/W/ Ai att = 51.9+| g, which is consistent with the value y/~K / Ai att = 49.6 
(3.8) estimated by Booth et al. (1992) from results on a (36) 4 lattice. Taking y[K = 
440MeV gives Ai att ~ 8.5 MeV, and from (17.2) A ~ 255 MeV. 

At small r the attractive Coulomb-like term dominates. It is found that o'| aU (r ) is 
a slowly varying function of r that decreases with decreasing r, as expected from 
perturbation theory (Section 16.3). The potential of Fig. 17.1 is well fitted with 

aiatt (r) = 0.236 — (0.0031 fm)/r. 

This is to be compared with the value of a = e 2 /An ~ 1/137 of QED. 

It is interesting to note that the linearly rising term in the potential is computed 
in the quenched approximation. If quantum fluctuating quark fields were to be 
included, the large potential energy available at large separation distances of the 
fixed quark and antiquark pair would produce pairs of quarks and antiquarks. A 
quark would migrate to the neighbourhood of the fixed antiquark to form a colour 
singlet, and an antiquark would similarly form another singlet with the fixed quark, 
resulting in two well separated mesons. 


17.2 Lattice QCD and hadrons 

Systems of quarks and antiquarks held together by the associated gluon field are 
called hadrons (see Section 1.4). For example, the proton, the only stable hadron, 
has up quark number two and down quark number one. Other systems, for example 
mesons, are held together only transiently by their gluon field. As well as these 
so-called valence quarks that define a system, a hadron contains quark-antiquark 
pairs excited by the gluon field, and known as sea quarks. 

So far, in our discussion of hadrons and confinement, sea quarks have been 
neglected. Convincing calculations of hadron properties require their inclusion 
especially uO. dd and ss pairs which because of their small masses with respect 
to Aqcd are readily excited by the gluon field Since the first edition of this book, 
much progress in lattice QCD has been made to include these pairs. 
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Quarks on the lattice require the introduction of quark masses. In the work of 
Davies et al. (2004) calculations are made with m u = m& (the isospin symmetry 
limit: see Section 16.6). A mean mass ( m u ± m&)/2 is introduced along with the 
masses m s , m c , mb, and the strong coupling constant#: five parameters in all. With a 
fixed value of g the lattice spacing a and the four quark masses are determined by fit¬ 
ting the five experimentally determined masses m(bbls) = 9.460GeV, m(bb2s) = 
10.023 GeV (see Figure 1.5), m n — 0.139 GeV, wir = 0.496 GeV and m D = 
1.867 GeV. The D + meson D(cs) is the ground state of the cs valence quark 
system. 

As in Section 17.1 the lattice spacing a is a function of g and so also are the quark 
masses. The calculations have to be repeated for different values of g to extract Ai att 
and g(a) and the four quark masses which are also taken to be functions of a. They 
can also be regarded as function of energy, he /a. The fact that the strong coupling 
constant and quark masses are functions of the energy at which they are measured 
is a natural feature of QCD. The calculations give, at an energy of 2 GeV for the 
light quarks 



m s (2 GeV) = 87±8MeV 


m c = 1.1 ±0.1 GeV 
m b = 4.25 ±0.15 GeV 


and 


a s (M z ) = 0.121 ±0.003. 


m c and mb are quoted at their own mass scale and it is conventional to quote a s 
at the scale of the Z boson. To find the parameters at different scales their energy 
dependence is given by equations like (16.25). 

Having values for the parameters of QCD its validity can be tested by confronting 
independent experimental data with calculations. At present one is confined to 
single hadrons that are stable to the strong interaction. Unstable particles or those 
that are close to instability tend to fluctuate outside the lattice boundaries. Also the 
baryons, and in particular the proton and neutron that carry u and d valence quarks 
can not yet be reliably handled on the lattice. Nevertheless many particle properties 
lend themselves to lattice calculations and the success in fitting data is impressive. 
Figure 17.2 shows results taken from Davies et al. (2004). Ten calculations are 
compared with experiment. The results are expressed as the calculated divided by 
the experimental value. The experimental values are accurately known and the errors 
that bracket the mean values indicate the estimated accuracy of the calculation. It 
seems that with present computing power, theory and experiment agree to better 
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Figure 17.2 Quantities calculated in lattice QCD divided by their experimental 
values: 


fn = “"/V2 GpVud see Section 9 - 2 ’ 

/k = “ k /v/2 G f Ks see Problem 9 - 10 - 

m $2 is the mass of the Q(sss), the ground state of the baryon with s quark number 
three. 

3m B — m N is a combination of ground state baryon masses S(ssu) and the 
neutron N(ddu). 

The other mass differences are between states of the cc and bb mesons (Davies 
et al., 2004 ). 

than 4%. There is no reason here to doubt the validity of QCD as the theory of 
strong interactions. 

17.3 Perturbative QCD and deep inelastic scattering 

One of the first applications of perturbative QCD was to the Q 1 dependence of 
the parton distribution functions of the proton. In the parton model of inelastic 
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Figure 17.3 The proton structure function FiQt, Q 2 ). The experimental points are 
fitted with curves generated by the evolution equations with A = 205 MeV. To 
aid reading in the left-hand section, the data have been scaled by the given factors, 
so for example at x = 0.18 the graph is of 2/-id). 18, Q 1 ). (Taken from Physics 
Letters B223, Benvenuti, A. C. et al. Test of QCD and a measurement of A from 
scaling violations in the proton structure factor F^ix, Q 2 ) at high Q 2 (Benvenuti 
et al., p. 490), with kind permission of Elsevier Science-NL, Sara Burgerhartstraat 
25, 1005 kv Amsterdam, The Netherlands.) 


electron-proton scattering (Appendix D), the proton is described by parton distri¬ 
bution functions pt(x, Q 2 ), where 

Or = -q^ = (p - P') 2 - (E - E'f, 

= (E — E', p — p') is the energy and momentum transferred in the inelastic 
electron scattering, and x = Q 2 /\2M( E — E')] where M is the proton mass. The 
partons are identified as quarks, antiquarks and gluons. Typically, at a fixed value 
of Q 2 , say Qj v distribution functions /;, (x. Q 2 } ) are extracted from the data, the 
number of distribution functions being determined by the number of distinct data 
sets. At this stage the extraction of the distribution functions is merely a matter of 
curve fitting: although the functions /;, (x , <2o) should be a consequence of QCD, the 
problem of establishing their form theoretically is immensely difficult. However, 
given these distribution functions, and provided Q 2 } is large enough, perturbative 
QCD can be used to predict how they evolve with changing Q 2 . This evolution 
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q 


Figure 17.4 e + e annihilation to a quark-antiquark pair with no gluon radiative 
corrections. 


is described by the equations of Altarelli and Parisi (1977), which take account 
perturbatively of the quark-gluon interactions. 

As an example, Fig. 17.3 shows experimental data on the related structure 
function Fi(x, Q 2 ) defined in Appendix D, taken by the BCDMS collaboration 
(Benvenuti et ai, 1989). Also shown are the theoretical predictions, at fixed values 
of x, of the QCD evolution as a function of Q 2 . The data are precise and the shapes 
of all the curves are given by the single parameter A. Fits to the data determine 
A — 205 ± 80MeV, from which one can infer, using (16.25) with n f = 5, that 
a s (M : 2 ) = 0.115 ±0.007. 


17.4 Perturbative QCD and e + e collider physics 


The basic Feynman diagrams for hadron production in e + e“ colliding beam exper¬ 
iments are shown in Fig. 17.4. In the range 10 GeV to 40 GeV, electromagnetic 
processes dominate. The data were discussed in Section 1.7. 

Around 90 GeV, close to the centre of mass energy for Z production, the weak 
interaction dominates. The hadronic decays of the Z were discussed in Chapter 
15, using perturbation theory. Flowever, there are additional contributions to the 
cross-section arising from gluon radiation, for example the processes illustrated in 
Fig. 17.5. 

The modification is simply expressed (see Particle Data Group, 1996). If the 
hadron production cross-section without gluon radiative corrections is denoted by 
cro then (to order a 2 ) the cross-section a with corrections is 


o- = f<* 0, 


with 



(17.5) 
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Figure 17.5 The lowest order gluon radiative corrections to quark-antiquark pair 
production by e + e~ annihilation. 


and a s ( Q 2 ) taken at Q 2 equal to the square of the centre of mass energy. For example, 
taking a s (M |) = 0.115 ± 0.007 from Section 17.3 gives / = 1.038 ± 0.003. This 
is the value of / used in Chapter 15. Alternatively, the best fit to the hadronic 
decays of the Z would suggest / = 1.041 ± 0.003, which gives a,(M?) = 0.123 ± 
0.007 and A = 310 ± 90MeV. The consistency of the theory between the two 
very different experimental regimes: electron-proton scattering and Z decays, from 
which these estimates are obtained, is impressive. 
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Figure 17.6 A three-jet event recorded by the JADE detector at the PETRA e + e 
collider, DESY. 


The hadrons produced in most e + e“ annihilations at high energies appear in 
two back to back jets associated with the originating qq pair. Gluon radiation 
contributing to the/factor is mostly confined to be within the associated quark or 
antiquark jet. However, according to perturbative QCD it is also possible for a gluon 
to be radiated into a distinct region of phase space and appear as a third distinct jet. 
Figure 17.6 is an example of such a three-jet event. Measurements of these three- 
and even four-jet events gives further strong support to the theory of QCD. 
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The Kobayashi-Maskawa matrix 


In Chapter 14, in the theory of the weak interaction of quarks, there appeared the 
Kobayashi-Maskawa matrix: 



Kid Ks Kb 
Kd Ks Kb 
Kd Ks Kb 


(18.1) 


and its parameterisation: 

( C12C13 S12C13 q3e _li \ 

-SnC23-Ci 2 S23S l 3 e lS C12C23 -.V 12 523 ^ 13 e 1 ' 5 S23C13 ( 18 . 2 ) 

512^23-Cl 2 C235l3e 1 ' 5 -Cl2^23 ~ S 12^23^136^ C23C13 / 

where C 12 = cos #12 > 0, S 12 = sinf^ > 0, etc. The KM matrix couples quark 
fields of different flavours. It contains four physically significant parameters, which 
can be taken to be the three rotation angles # 12 , $ 13 , 623 , each lying in the first 
quadrant, and the phase angle 8 . 

There is no theory relating these parameters, just as there is no theory relating 
quark masses. Indeed, the quark sector of the Standard Model may appear to the 
reader to be lacking in aesthetic appeal. The parameters of the KM matrix must 
be determined from experiment, and in this chapter we indicate how experimental 
information has been obtained. 

18.1 Leptonic weak decays of hadrons 

We have seen in Section 15.3 two unitarity sum rules that support the validity of the 
Standard Model, and there are many independent measurements that both test for 
consistency and given consistency determine the parameters. So far no definitive 
inconsistencies have been established, and a large body of data is well described 
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Figure 18.1(a) A Feynman diagram for the leptonic decay b -* c + e + v e 



(b) A quark model diagram for the decay B —»■ charmed hadron system + 
e~ + v e 


with the parameter values s \2 = 0.2243 ± 0.0016, 523 = 0.0413 =t 0.0015, S 13 = 
0.0037 ± 0.0005 and 5 = 57° ± 14°. 

A suitable starting point for the consideration of hadronic weak decays is first- 
order perturbation theory in the effective Lagrangian density of equation (14.21): 
L = where j ^ is given by (14.20). Leptonic decays are the most 

simple for theoretical analysis because the leptonic parts of a transition matrix 
element can be calculated with some confidence. If quarks were available as isolated 
particles, the three rotation angles of the KM matrix could be determined by the 
measurement of the decay rates of leptonic decays such as 

b —> c + e + "Ve¬ 


in lowest order perturbation theory (see Fig. 18.1a) the decay rate for this process 
is given by 


1 


r(b -* c) 


G l m l 

192tt 3 


|V cb | 2 f 



(18.3) 


where f{x) — 1 — 8x 2 + 8x 6 — jc 8 — 24.r 4 ln(x) is a factor associated with the 
available phase space. This programme cannot be carried out directly since the b 
and c quarks are accompanied by other spectator quarks and gluons (see the quark 
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model diagram of Fig. 18.1b), which involve the calculation of strong interaction 
matrix elements. To the extent that the hadronic matrix elements can be calculated, 
a measurement of the decay rate will determine | V c b 1 2 - 


18.2 | V u dl and nuclear ft decay 

Isospin symmetry (see Section 16.6) is important for the determination of the 
hadronic matrix elements of all nuclear (3 decays. Such decays involved the quark 
current 


# = 4^u L = ^(1/2)(1 - y 5 )u. (18.4) 


Here we have expressed the current in terms of the Dirac four-component spinors 
u and d, with the help of the projection operator (1/2)(1 — y 5 ) introduced in (5.32) 
and noting d = d^y 0 . 

As in Chapter 16, we now take the u and d quarks together in an isotopic doublet: 


D(x) = 


/ u(x)\ 
\d(x)) 


The isospin operator (1 /2)(t 1 — it 2 ) has the property 



so that we may write (see (16.31)) 

j“ = (l/4)D(x)y M (l - y 5 )(r 1 - ir 2 )D(x) 

= (1/2 )[v fJ '(x)-a ll (x)]. (18.5) 


We have split the current into the part v ll (x), which transforms like a vector under 
space inversion and the part a ,l (x), which transforms like an axial vector (see 
Section 5.5): 

v^x) = (1 /2)Dy /x (r 1 - ir 2 )D, (18.6) 

a^(x) = (l/2)Dy /x y 5 (r 1 - ir 2 )D. (18.7) 

We saw in Section 16.6 that exact isospin symmetry leads to conserved currents: 

vf = (l/2)Dy M T'D, (18.8) 

so that the vector part of the decay current of the u and d quarks is a conserved 
isospin current. 
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In the case of nucleons, we denote the isospin doublet of the effective Dirac 
fields p{x) and n(x) of the proton and neutron by 


D n (x) = 


/ P(x)\ 

\n(x)) 


(18.9) 


An effective Lagrangian density that at the low energies of nuclear physics describes 
the (3 decay of a nucleon is 

L eff = -2j2G ¥ C\jl& + I, (18.10) 


with 

,/n = ^Dnk^I -g A y 5 )(r, -ir 2 )D N . (18.11) 

Experimentally, it is found from a range of nuclear data that 


C = 0.9713 ±0.0013 and g A = 1.2739 ± 0.0019. 

(See Particle Data Group.) 

The vector part of the current / N is the conserved isospin current of nuclear 
physics and corresponds to the more fundamental conserved isospin current at 
the quark level. Exact isospin symmetry would require that the contribution of the 
conserved nucleon isospin current to the effective interaction (18.8, 18.9) be the 
same as that of the quarks in (18.5, 18.6), so that we identify C = E u d = 0.9713 ± 
0.0013. 


18.3 More leptonic decays 

The most precise estimates of | V us | have come from observations of leptonic 
K decays, for example K“(su) —> 7T°(uu — dd)/\/2 + e“ + v e . Analyses of these 
decays by lattice QCD, quark model calculations, and calculations based on chiral 
symmetry (see Section 16.7) all converge on the value | V us | = 0.224 ± 0.003. 

Estimates of |V CS | and | K cc i| can be extracted from D decays, for exam¬ 
ple D“(cd) —> K°(sd) + e“ + D e orD“(cd) —> 7r°(uu — dd)/V2 + e“ + v e - These 
decay rates are proportional to |V CS | 2 and 114d 1 2 respectively. 

More experimental information on | Vtf \ 2 comes from the deep inelastic scattering 
of neutrinos by atomic nuclei through processes such as 

± d —> p. - + c. (See Appendix D.) 

Atomic nuclei provide an abundant source of d quark targets. The cross-section 
for producing a c quark rather than a u quark can be inferred by identifying those 
c quarks that decay as c —> d + p + + Overall, a characteristic p + p“ pair is 
produced. 
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The conclusions, after much work along the lines indicated, and without imposing 
the unitarity condition, are 

|V cd | - 0.224 ± 0.014, | Vcs| = 1.04 ±0.16. 

Leptonic decays of B mesons (bu, bd, bu andbd) provide the best data on | V c bl 
and | Kb I, Three experimental facilities have been constructed to measure B decays: 
in the USA at Cornell (Cleo) and Stanford (Babar), and in Japan (Belle). At these 
‘B meson factories’ many million B mesons have been produced for analysis. 

In the case of | V c b I, the hadronic matrix elements for decays like B” —»■ D° + 
e“ + -v e can be calculated taking the heavy b quark in the B“(b, u) meson as static 
in first approximation. Analysis of the data gives 

| Vc b I = 0.0413 ±0.0015, | V ub | = 0.00367 ± 0.00047. 

The remaining three elements of the KM matrix involve the top quark. The 
mean life of the top quark is so short it is likely to decay before it has time to 
settle into a top quark hadron. The methods described above are unavailable for 
IK I (i = d, sorb). 


18.4 CP symmetry violation in neutral kaon decays 

In Section 14.4 we obtained the important result that the quark sector of the Standard 
Model is not invariant under the charge conjugation, parity, operation unless all the 
elements of the KM matrix can be made real. With the parameterisation (18.2), this 
requires that the phase angle 5 = 0. 

CP violation was first observed in 1964 in the decay of neutral K mesons. The 
states of definite quark number are the K°(ds) and K°(ds). These mesons are readily 
produced in strong inter actions, forexample7t _ (ud) ± p(uud) —> K°(ds) ± A(uds). 
Without the weak interaction the K° and K° would have equal mass and be stable. 
The weak interaction is responsible for their instability and CP violation would be 
manifest if for example it were seen that the decay rates K° n + n~ and K° — 
7T + 7t _ were different. Such a difference can occur in second-order perturbation 
theory in the weak interaction (first order in Gf. See (14.21)). This is known as 
direct CP violation. 

The weak interaction also gives rise to the phenomenon of mixing (Appendix E, 
Fig. El). Although mixing occurs only at second order in Gp it has the dramatic 
effect of splitting the mass degeneracy: it results in two mixed states of different 
mass. If CP were conserved the mixed states would be 

IK?) = (l/V2) (|K°) ± |K 0 )) and |K?) = ( 1 /V 2 ) (|K°) - |K 0 )) . 
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Acting on K° and K°, the CP operator may be taken to give 

CP |K°) = |K°) and CP |K°) = |K°). 

Then | K°) and | K°) are eigenstates of CP with eigenvalues +1 and — 1 respectively. 
Experimentally two states with a mass difference 3.5 x 10“ 12 MeV are indeed 
observed; they also have very different mean lives 

t s = 8.9 x 10 _11 s, t l = 5.17 x 10 _8 s. 


The K° decays predominantly into two pions, 7t + 7t _ or 7t°7t°. Each of these 
two-pion states is an eigenstate of CP, with eigenvalue +1 (Problem 18.2). In its 
mesonic decay modes, the Kj 1 decays predominantly into 7t°7t° 7t°, and these three- 
pion states are eigenstates of CP with eigenvalue —1 (Problem 18.3). However, in 
about three decays in a thousand Kj’ decays into two pions, with CP eigenvalue +1. 
If CP were conserved K£ would be either K° or K)’ and could not have both two 
pion and three pion decay modes. CP violation is also seen in leptonic K decays. 
These show that direct CP violation is not responsible for the anomalous Kj’ decays 
but they are predominantly due to CP violation in mixing. 

It is shown in Appendix E that neither |K°) nor |Kjj) is an eigenstate of CP, but 
each can be written in terms of |K°) and |K°): 


K°) = N [p |K°) + q |K°)], 
K°) = N [p |K°> -q |K°)]. 


(18.12) 


N is the normalisation factor: ( \p\ 2 + |g| 2 ) -1 ^ 2 . Note that q is not equal to p. In 
Appendix E we indicate how p and q can be calculated in the Standard Model. 

We can similarly express |K°) and Kj’J in terms of |K°) and |K)’j: 

\K) = (»/V5) \{ P + ,) |k;) + ( P - q) |k;)] , 

/ (10. I J) 

|K°) = (N/y/l) [(p - q ) |K°) + (p + q) |K§)] . 

Neglecting direct CP violation only K° can decay into jttt so that the ratio of the 
decay rates 

T(Kl) —> 7t7t \p/q — 1| 2 

-= -- = (5.25 ±0.05) x 10 6 (fromexperiment). 

T(K S ) 7t7t \p/q+l\ 2 

Defining p/q = \ + 2 £ K we infer that |£ K | = 2.3 x 10 -3 ; £k is a measure of CP 
violation. 


7feo£A.&wta£teaf P/t^J ic.1 




182 


The Kobayashi-Mciskaxva matrix 



Figure 18.2 The unitarity triangle. 

18.5 B meson decays and B°, B° mixing 

At the B meson factories the 4s (bb) meson is copiously produced by e + e“ collisions 
with beam energies turned to the meson mass. The meson decays almost exclusively 
into B + , B“ or B°, B° pairs and so provides a rich source of B mesons. With a mass 
of 5.28 GeV, B mesons decay into many different final states and many exhibit CP 
violation. An indication of why this is so can be seen by a consideration of the 
unitarity condition 

v ud v u * b + VcdV c ; + v td v* = o, 

which can be written as 


Zi + Z 2 = 1 


(18.14) 


where we have defined z\ 


VudV. 


ub 


VcdV c 


and z .2 = 


VtdV. 


tb 


cb 


VcdK 


cb 


z i and 7,2 are complex numbers that, in the complex plane form a triangle, the 
unitarity triangle illustrated in Fig. 18.2. Also it can be seen from the parameters 
given in Section 18.1 that V c d V* b is almost real and negative. Neglecting its very 
small imaginary part, the angle y = 8, the phase of V^, and ft is the phase of 
Vft A . Of all the unitarity triangles, this is the only one with direct access to the 
two KM matrix elements with large phases; it also involves the b quark and hence 
B mesons. 

Of particular importance has been the measurement of the angle a through both 
charged and neutral decays B —> 7t7t, B —> 7tp and B —> pp and of the angle ft 
through B°, B° mixing. As one example it is shown in Appendix E how sin(2/3) is 
measured at the B factories. 
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Figure 18.3 The apex of the unitarity triangle is in, or near, the shaded region of 
the plot. 


The unitary triangle is specified by the position of its apex. This requires two 
parameters, say the real and imaginary parts of zi- A single parameter defines a 
line on the complex plane and a parameter with errors defines a band. Four such 
bands inferred from experiment are shown in Fig. 18.3. The most important point 
illustrated by the figure is the consistency between four independent measurements. 
There is no indication of the Standard Model failing. The KM phase 8 (~y) can be 
seen to be in the region 8 = 57° ± 14°. The apex of the unitarity triangle is in, or 
near, the shaded region of the figure. 


18.6 The CPT theorem 

We denote by T the operation of time reversal, t —> t' = — t. The CPT theorem 
states that, under very general conditions, a Lorentz invariant quantum field theory 
is invariant under the combined operations of charge conjugation, space inversion, 
and time reversal. The theorem was discovered by Pauli in 1955. 

For the Standard Model, the CPT theorem implies that, since CP is not a sym¬ 
metry of the Model, then neither is time reversal T. One may contemplate the 
implications for the ‘Arrow of Time’. 
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Problems 


18.1 Draw quark model diagrams for the decays 

7t _ -»• pT + K~ -s- p - + -v^. 

Show that the decay amplitudes are proportional to y u d and y us respectively, and 
Pus/Kd = tan 0 i 2 . 

Neglecting the effects of the different quark masses, the ratio a^/a n calculated 
in Problem 9.10 would equal P us /P u d- Use this observation to estimate sin 0\2- 

18.2 A 7t° meson is even under the charge conjugation operation C, i.e. Cj7t 0 } = 17T°). 
Also, C|7t + > = 17T ) and C|7t - } = |7t + ). 

Show that two pions |7t°, 7t°> or |7t + , 7t _ > in a relative S state and with their centre 
of mass at rest satisfy CP\n, n) = |7t, n). 

18.3 Show that a state of three 7t° mesons |7t°, 7t°, 7t°> with angular momentum zero and 
centre of mass at rest satisfies CP |7t°, 7t°, 7t°> = —|7t°, 7T°, 7t°>. (See Problem 18.2.) 

18.4 Show that the area of the unitary triangle of Fig. 18.5 is J/2. 

18.5 Show that if the quark fields are subject to a change of phase 

d —> Q l0d d b — 

then the unitary triangle of Fig. 18.5 is rotated through an angle (Gd — 0\>). 
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In this chapter we introduce the phenomenology of neutrino masses and mixing, 
and show how the phenomenology can be made to be consistent with the SU{2) 
x 17(1) broken gauge symmetry of the Standard Model. We take it that neutrinos 
and antineutrinos are distinct Dirac fermions, setting aside, until Chapter 21 , the 
suggestions that neutrinos are Majorana fermions. 

The phenomenology arose from the observations that the number of electron 
neutrinos arriving at the Earth from the Sun is only about half of the number 
expected from our knowledge of the nuclear reactions that occur in the Sun, and the 
physics of the Sun’s interior. These observations are now explained as the result of 
some electron neutrinos turning into muon neutrinos and tau neutrinos during their 
transit between their creation in the interior of the Sun and their observation on 
Earth. These transitions violate the conservation laws of Section 9.3. We will show 
that they occur because the e, p and t neutrinos are not massless but, as conceived 
by Pontecorvo (1968) they do not have a definite mass, i.e., they are not eigenstates 
of the mass operator. 


19.1 Neutrino masses 

The most general Lorentz invariant neutrino mass term that can be introduced into 
the Lagrangian density of the Standard Model is 

CassW = - v «l (x) m u pVp R (x) + Hermitianconjugate, (19.1) 

oc,/3 

where is an arbitrary 3x3 complex matrix, a and ft run over the three neutrino 
types e, p, t, and v„l (x), v„r (x) are left-handed and right-handed two-component 
spinor fields. (Spinor indices are omitted here.) 
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An arbitrary complex matrix can be put into real diagonal form with the help of 
two unitary matrices (see Problem A.4). We can write 

m aP = Y d U];;m i U% (19.2) 

i 

where m, are three real and positive masses; U L and U R are unitary matrices. It is 
evident that U [ aj and C/jy can be replaced by U^e~ lS ‘ and U^e~ lSi , where the 5; are 
three arbitrary phases. 

If we now define the fields 

AlOO = J2 U ai V <* l(x), 

, , ^ ttR , x (19-3) 

Ar 00 = 2 L U*v aR (x), 

a 

the mass term takes the standard Dirac form (5.12) 

£ mass 00 = “ Y m '(4 AR + vJ R V iL ) ■ (19.4) 


It is easy to show that the transformations given by equations (19.3) retain the Dirac 
form of the dynamical terms: 


nv 

Alyn 


Ei[ v i L ff/ * 9 /* v «L + vlgO^Va R ] 

Of 

^i[v,! L ff /i 3 Jl ViL + v/ R a^d^VRi]. 


(19.5) 


(^dyn + £' nVdSS ) is the Lagrangian density of free neutrinos of masses m\, m 2 , m 3 . 
Since U L and U R are unitary matrices, and a unitary matrix U satisfies UU 1 = 
ITU = I, we can invert equations (19.3) to give 

VaL 00 = Y U «i V ' L (X ^ 

A r* (19-6) 

VaR (x) = Y U a*ViR(.x). 
i 


The e, p. and t neutrinos are mixtures of the neutrinos having definite mass. We 
shall see that this leads to the phenomenon of neutrino oscillations. 


19.2 The weak currents 

Neutrinos interact with each other and with other particles through the weak cur¬ 
rents. The charged weak current (9.2), expressed in terms of the neutrino mass 
eigenfields using (19.6), becomes 

j' 1 = = Y a l &llU ai v iL (19-7) 

of of ,i 

a L are the charged lepton fields a = e, fi, r. 
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The neutral weak current (9.17) keeps the same form: since U L is unitary, we 
have 


EO'.l^v.l = Y](v iL ) ] a ll v iL . (19.8) 

oc i 


As an example of how these modifications influence the physics discussed in 
earlier chapters, consider our effective pion interaction (9.1): 

Ant = a n \j' + j' ^ • 


The /I decay rate formula (9.3) for 7T —»■ e + v e becomes three decay rates: 


1 


r (jr 


e V,-) 



i = 1,2,3. 


In the derivation of this result the effects of small neutrino masses have been 
neglected. Because neutrino masses are small (see Table 1.2), it is not possible with 
present technology to discern differences in energy between these decay modes. The 
total decay rate is measured, and since U^U^f — 1 we recover the expression 
(9.3) for this. A similar conclusion can be drawn about the processes 7T _ —> pr + 
and t — —> 7T — +"v T , described in Section 9.2 by the same effective Lagrangian, 
and about the results on muon decay of Section 9.4. 


19.3 Neutrino oscillations 

The Lagrangian density (19.1) with (19.5) for a free neutrino yields the equations 


icr^a^L - m a pVpR = 0, 
icr^^R - m*p a Vp L = 0. 


(19.9) 


These equations are a generalisation of the Dirac equations (5.11), and in this 
section we shall interpret their solutions as neutrino wave functions for the three 
types a = e, fi, r, not as neutrino fields. We shall look for energy eigenfunctions 
with time dependence e -l£r . 

Zero mass neutrinos would have plane wave solutions of negative helicity (see 
Section 6.6). For a wave in the z direction 

Am (z, t) = e~ lE{ '~ z) f a ^ j ^ , n„ R = 0, 


where the f a are constants. 
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The introduction of neutrino masses modifies these solutions by allowing the f a 
to depend on z: 

'O' 


VaL (z, t) — e 


— -i E(t-z) 


fa(z) 


VaR (Z,t) = e ' E(t Z) g a (Z) ^ j 

Substituting in the Dirac equations gives 
d 

13 fa (z) - m a pgp ( Z ) = 0, 

dz 


(2£-il) SvW 


m*yfa(z) = 0. 


(19.10) 


(19.11) 


Note that a 


3 I 0 


er 


For neutrino energies much greater that their mass we can neglect — idg y /dz 
compared with 2Eg y (see Problem 19.1) to obtain 


gy (z) = m* f a (z) / 2E , 


(19.12) 


and hence by substitution three coupled equations for f a (z,)- 

i j-//s (z) = m Py m* f a (z) /2 E. 

dz 

Diagonalising the mass matrices mp y and m ay gives 

i %-U (z) = uy t U E j fa (z) m]!2E. (19.13) 

dz 

The right-handed U R do not now appear, so that the label L is now redundant and 
we shall put U R = U a i for the remainder of this section. 

To solve these equations we construct linear combinations 


fiiz) = Uaifaizf, 


i = 1, 2, 3. 


which satisfy, using (19.13), 

i ^fi (z) = i Ua^ fa (z) = U ai U* aj Up ] m)fp (z) /2 E 
= 8 ij u pj m )fp (z) /2 E = (m]/2E) fi (z). 
These uncoupled equations have the simple solutions 

fi (z) = e-^/^/i ( 0 ) ■ 


(19.14) 


(19.15) 
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Inserting the factor e ' E(t z \ the r, neutrino wave function is 

V/ ( Z , t) = Q -'Et+{E-m*/2E)z f. (Q) (19.16) 

This state has energy E and momentum p t = E — mj/2E. For mj <P E 1 , pj = 
E 2 — mj, which is the relativistic relationship for a particle of mass m,. Thus the 
neutrino v, carries mass m ; . i 1 , (z, t) are the left-handed wavefunctions of (19.3). 

Suppose that at z = 0 a neutrino of type a is born. The v a wavefunction is a 
linear superposition of mass eigenstates v, with f\ (0) = U ul f a (0). Different mass 
eigenstates propagate with different phases so that the neutrino type changes with 
z: 


Mz ) = U* pi fi (z) = Upje~K m V 2E ) z u ai f a (0). (19.17) 

To be exact a neutrino is born as a wave packet in some localised region of space 
time around some point z = 0, t = 0. A realistic treatment of its propagation requires 
the construction of the appropriate wave packet. We take it that the packet travels 
with almost the speed of light and with little distortion so that having travelled 
a distance z = D the probability amplitude for finding a neutrino type /I will be 
e -mr- D) ffj {Dy 

The probability of a transition Pd (v a i’/j) is 

(19.18) 

ReiUpiUcjUpjUaj) is symmetric and Im ( Uj^ U ul Up]U*-) antisymmetric under the 
interchange of i and j, from this and the unitarity of U we can write 

Pd (v„ -* v p ) = S aP - 4^>e (U^UpjUP) sin 2 

i>j 

+ 2 !m (Up t JJd Upj U*j) sin 

i>j 


A mjjD 
2 E 





Pd(V& -> vp) = 


* e~K m l/ 2E ) z Tj 


Upfi 


where Am 2 - = mj — mj. 

These expressions describe the phenomena of neutrino oscillations. We note 
that experiments designed to observe and measure neutrino oscillations (Chapter 
20) can only give values for the differences Am 2 -, and cannot give values for the 
individual masses m, . The differences must satisfy the condition 

Amp + Am 2 3 + A m 31 = 0. 
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Neutrino masses and mixing 


Restoring factors of c and/?, it will be useful to write 


A mfjD 


D\ 1 


= A mr 4 — — = 1.27 
4 E ,J \Hc 4E 


1 Am? ; r 4 ' 
leV 2 


D 

fkm 


(•-¥) 

(19.20) 


By considering the equations for the charge conjugate wave functions v‘ a (see 
Section 7.4), similar formulae result, but with U a j replaced by its complex conju¬ 
gate U*i- If Im {UpjUcjUpjU*j) is not zero it changes sign for antineutrinos and 
P D {v a —> vp) P D (v a —> vp). The lepton sector joins the quark sector in display¬ 
ing matter-antimatter asymmetry. 


19.4 The MSW effect 

In many experiments that investigate oscillations the neutrinos are not completely 
free, but pass through matter on their journey from source to detector. This modifies 
the free wave functions discussed in the previous sections. In particular, matter 
contains electrons that interact with neutrinos through the charged weak currents. 
The effective interaction Lagrangian for this process is given by (9.8): 

Ant = -2V2G F8lxv j*j v \ 

where, from (9.2), j v ^ = v^a^e l, giving 

Ant = -2V2 G F g l , v (eia> 1 v eL )(vl L cr v e L ) 

= -2V2 G F g^(ela^e L )(vl L & v v eL ). (19.21) 

The last step uses a Fierz transformation (Appendix A), 

For matter at rest, the expectation value of eja n e L = ej er = jN e (x) where 
N t (x) is the total electron density at x. The factor of 1 /2 stems from the involve¬ 
ment of the left-handed electron field components only. Also, apart possibly from 
ferromagnetic effects, we can expect that the expectation value of e\a' e L = 0. The 
neutrino Lagrangian density acquires an additional term —V2GpN e (x) iv L n e L- This 
results in the modified equations for/(z): 

- mp r m* ar f a (z)/2E - V (z)8p e f e (z) = 0, 

or equivalently (see equation 19.15) 

d fi ( z ) mj * 

= xTN.fi (Z) + V (z) U e /U* ■ f j (Z) (19.22) 

d z 2E J 

where V(z) = V2N e (z) G F . 
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The influence of matter on the propagation of neutrinos was pointed out by 
Wolfenstein (1978), and further elaborated by Mikheyev and Smirnov (1986). It is 
known as the MSW effect. 

The neutral weak currents also contribute to the Lagrangian density of all neutrino 
types and result in an additional common phase factor on the wave functions of all 
types, which has no influence on neutrino oscillations. 


19.5 Neutrino masses and the Standard Model 


In the Weinberg-Salam electroweak theory for leptons of Chapter 12 we introduced 
three left-handed lepton doublet fields: 



and three right-handed singlets e R , /z R , r R . Under an SU(2) transformation, 
L„ -* \f a = UL„, a R — a R = a R . 


Dirac neutrinos having mass implies the existence of right-handed neutrino fields. 
In the Standard Model the right-handed neutrino fields, like the right-handed fields 
of the charged leptons, must be SU{2) singlets. Neutrino masses are introduced into 
the model in the same way as the u, c and t quarks by coupling to the Higgs field. 
An SU{2) invariant coupling of the Higgs field to neutrinos is then (equation (14.9) 
and Problem 14.3.) 


= - E [ G U ( L I s ®*) 

a(3 


(19.23) 


where GU is a complex 3x3 matrix. On symmetry breaking this gives the neutrino 
mass term 


Anass — ^ 2 , [^A/ElAiSR + ^c/p V \ R V «L 

a, p 


(19.24) 


This is just the mass term of equation (19.1) if we identify </> 0 G)C with m a p. 


19.6 Parameterisation of U 

We have taken the parameters m e , mm T and gi to be real and positive, but this 
is in fact a phase convention: any phase on these parameters can be absorbed in 
phase factors multiplying the lepton fields, and such phase factors are of no physical 
significance. It is also the case that the definition of the mass matrix m a p depends 
on a phase convention. 
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Define the six neutrino fields v ' aL , v' aR (a = e, u, t) and the six charged lepton 
fields a^, by 


v a L - e‘ e “v, 


ofL’ 


VaR = e ly “ V, 


uR’ 



= e ie ‘ 



The leptonic part of the electroweak Lagrangian density described in Chapter 12 
(equation (12.12)), and the charged current (equation (12.16)) and neutral current 
(equation (12.23)) that give the neutrino coupling to the W ± and Z fields, are 
unchanged in form under these transformations. The neutrino mass matrix retains 
the same form but with m a p replaced by 

m' a p = e~ lda+lyf, m a p. 


We can redefine m a p in this way, keeping the physical content of the theory 
unchanged. 

The unitary matrix U L was defined by m a p = Hence we can 

redefine = e^ e “~ Si ^UP, where the phase factors e" 5, were introduced in Section 
19.1. As in our discussion of the KM matrix in Section 14.2, when the non-physical 
phase factors have been taken out, the resulting matrix depends on four physical 
parameters. We parameterise it in the same way as the KM matrix but replace (hj 
by 0 e j, Oij by Op,j and O^j by 0 X/ -, etc. It can be called the neutrino mass mixing 
matrix. 

The term exhibiting matter-antimatter asymmetry in —> vp) is (see 

Problem 19.2) 


2 ^2 Im {Upi U ai U pj U aj) sin 

i>j 

0 if a = /3 


AmjjD 
2 E 


±8/ sin 




sin 




sin 


Am^D 
4 E 


, otherwise 


where J = c e 2Cg 3 c^35e25e35n3 sin<5, cf. (14.18, 14.19), the minus sign is taken for 
transitions e —»■ p., q —>■ t, t —»■ e, and the plus sign otherwise. 


19.7 Lepton number conservation 

Having defined the phase conventions that fix the parameters of the neutrino mixing 
matrix, the Lagrangian density has only one remaining global 1/(1) symmetry. It is 
unchanged if all lepton fields, charged and neutral, left-handed and right-handed, 
are multiplied by the same phase factor e lS . Following the method of Section 7.1 , we 
consider an arbitrary small space- and time-dependent variation in S, and conclude 
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that we have one conserved current: 

m + o^CxOa^aRCr) 


+ vL (x)d^v aL {x) + v' R (^)CT M v aR (^:) 


(19.25) 


The quantity fj°(x ) d 3 x counts the number of leptons minus the number of 
antileptons, and this number is conserved. 


19.8 Sterile neutrinos 

We will see in the next chapter that there is some experimental indication that there 
are more than three neutrino mass eigenstates. If these indications are confirmed 
then we will be obliged to introduce a fourth neutrino type (perhaps more), say 
v w . Since there is no indication of another charged lepton to partner r> wL in an 
SU( 2) doublet, and since the decays of the Z (Section 13.6) confirm that only three 
neutrino types participate in the weak interaction, both u w l and v wR must be SU(2) 
singlets and have no electroweak interactions except through the mass eigenstate. 
Such a neutrino is known as a sterile neutrino. 


Problems 

19.1 Neglect the term i(dg y /dz) in (19.11) and show that g y (z) = m ay^ a ^/2E- Show 
that an estimate of i(dg y /dz) is then \dg Y (z)/^ z — S y p(2Egp(z)) with S y p = 
m* y in a p/4E 2 , very small for E much greater than the masses. 

19.2 Define Fp aij = Im (£/* U ai Upj U* aj ) 

(a) Show that Fp a ij = —Fp a jj and that '52jFp a ij=0, and hence that 
Fp a 12 = Fp a 23 = Fp a y\ . Define J = /y lc 1 2 (this conforms with (14.18) and 
(19.25)). Using the trigonometric identity sin(.r) + sin(y) — sin(x + y) = 
4 sin(x/2) sin(y/2) sin((x + y)/2). 

(b) verify the matter-antimatter asymmetry term in (19.25) for Po(v a vp)- 
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Neutrino masses and mixing: experimental results 


The cross-sections for neutrino-lepton and neutrino-quark interactions are exceed¬ 
ingly small: the collection of data from a particular experiment may extend over 
several years. The aims of neutrino experiments include: establishing the existence 
of neutrino oscillations, checking the validity of the theory of Chapter 19, measur¬ 
ing the parameters of the mixing matrix U and determining the mass eigenstates of 
the neutrino. In this chapter we shall present some results of recent experimental 
work, and indicate how they have been obtained. 


20.1 Introduction 

Setting aside the possible existence of sterile neutrinos, it is thought that there are 
three neutrino mass eigenstates, which we shall label by i = 1,2, 3. Measurements 
of neutrino oscillations give (mass ) 2 differences: 

Am- = m 7 — m 7 . 

It is estimated from experiment that 

1.3 x 10 “ 3 eV 2 < | A/W 32 I < 3 x 10 “ 3 eV 2 , 

and 

6.5 x 10 -5 eV 2 < |Am 2 j| < 8.5 x 10 _ 5 eV 2 . 

Then Am^ = Am 2 2 + Am 2 r 

For illustrative numerical calculations in this chapter we shall take | Am 2 0 | — 
2 x 10 “ 3 eV 2 and | Am 2 , |=7x 10 “ 5 eV 2 . 
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A 




m 


2 

3 


(Mass) 2 

V//////////A m 2 2 

W//////MW/MM ^ ,„ 2 


Figure 20.1 A three neutrino mass-squared spectrum. The y e fraction of each mass 
eigenstate is indicated by right-leaning hatching, the "V H fraction is blank and the 
y T fraction by left-leaning hatching (see the report by B. Kayser, Particle Data 
Group, 2004). The mass-squared base line is not known. 


The 3 x 3 unitary mixing matrix is approximately 


U = 


_ s/*j2 


s 

5e3e l5 

7 V 2 

1 /V 2 

c/V2 

1/V2 

s 0.54. 



( 20 . 1 ) 


where c ~ cos 9 e o ~ 0.84 and s ~ sin 0 e 2 

It is estimated that Is^l 2 < 0.05. A term s e ^e lS with sin5 ^ 0 would violate 
CP conservation and lead to matter-antimatter asymmetry. Such asymmetry has 
not yet (2006) been discerned. If s e 3 ^ 0 there are small complex corrections to 
other elements of the matrix. (The matrix (20.1) may be obtained from the unitary 
KM matrix of Section 14.3 by taking cn(=c e 3 ) = l,cn (=c e 2 ) = c, C 23 (=c^) = 


1/V2, 513 = 5e3)- 

The (mass) 2 differences imply either a spectrum of (mass) 2 eigenstates as in 
Fig. 20.1, with the closest eigenstates having the smallest mass, or the figure might 
be inverted, with the closest (mass) 2 eigenstates the heaviest. The mixing matrix 
determines the fractions of "v e , "v u and y T states making up the states 1, 2, 3, and 
these are indicated on the figure. 

In many data analyses the approximation is made of setting ,v C 3 = 0. We shall 
see that any particular analysis is then greatly simplified since the number of 
participating neutrino mass eigenstates is reduced from three to two. Apart from our 
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discussion of the CHOOZ experiment, we shall always make this approximation. 
However, as the quality of data improves, and in particular when and if s e 3 is seen 
to be finite, the approximation will be abandoned. It is important to note that with 
s e3 = 0 there is no CP violation. 

The analysis of data from accelerator and reactor neutrinos is the least compli¬ 
cated, since the MSW effect is negligible at the levels of precision so far obtained, 
and our formula (19.19) can be directly invoked. 


20.2 K2K 


The Japanese K2K experiment studies a muon neutrino beam that is engineered at 
the KEK proton accelerator. 12 GeV protons hit an aluminium target, producing 
mainly positive pions that decay n + —> p + + (Section 9.2). The beam char¬ 
acteristics are measured by near detectors located 300 m down-stream from the 
proton target. The mean energy is 1.3 GeV. There is then a 250 km flight path 
to the Super-Kamiokandi detector in the Komioka mine. This detector consists of 
22.5 kilotonnes of very pure water (HoO). Muon neutrinos are observed through 
their reaction with neutrons in the oxygen nuclei: x/^ + n —»■ p + p“. The neutrino 
energy Ey can be determined from measurements of the energy and direction of 
the muon. 

To reach the detector, a neutrino has to pass through the Earth’s upper crust. How¬ 
ever, we ignore any MSW effect for the moment, and take the values of Am 21 given 
in Section 20.1. Am 21 = 7 x 10 _ 5 eV 2 and D = 250 km. From (19.20) the oscil¬ 


lating function sin 2 


( A m 21 Z) ^ 

V 4Ev ) 


, , ,1 GeV\ 

= sin~ I 0.022 ‘ ' 


) 


<10 3 for all relevant 


E y . This is so small that with present precision it can be ignored. Also, since Am 2 , = 
Am 2 ,-, + Am 2 ! the two other oscillating functions are almost equal, and we will take 


them both as sin~ 


A m\ t D 
4 Ey 


with A m\ t a mean value of Am 2 2 and Am^. 


For 


historical reasons A m\ t is called the atmospheric mass squared difference. 

With these approximations, setting U t 3 = 0 and using the unitarity of U, equa¬ 
tions (19.19) give 


~ to/ A m\,D\ 

P D (y» -* V = 1 -4\U^\\\ - 11/^31 2 )sin 2 (-J , 

Pd(N\i -* W) = 0, (20.2) 

~ 00 / A m 2 At D\ 

Poty* -* W) = 4\u^\ 2 (\ - | f/^ 3 1 2 ) sin 2 l- ■ 


From these equations, and because of the smallness of | t/ e 3 1 2 , the Vm 2 Xt oscil¬ 
lation is almost entirely between 'v^ and xr T . Since the MSW effect is for electron 
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Figure 20.2 K2K data (M. H. Ahn et al. Phys. Rev. Letts. 90, 041801 (2003)). 
Points with error bars are data. The box histogram is the expected spectrum with¬ 
out oscillations, where the height of the box is the systematic error. The solid 
line is the best-fit spectrum. These histograms are normalised by the number of 
events observed (29). In addition, the dashed line shows the expectation with no 
oscillations normalised to the expected number of events (44). 


neutrinos only, it can with present precision be neglected. With U e 3 = 0, we have 
| f/^ 3 1 — sin 0^3 and we arrive at our final formula: 

, , / Am 2 Al D\ 

p D(y^ -* -Vji) = 1 - sin-(26V 3 ) sin- l —J (20.3) 

for fitting the K2K data. This is presented in Fig. 20.2 in which the number of 
events in the designated energy bins are shown as a function of the mean neu¬ 
trino energy of each bin. The dashed curve is the expected number distribution 
dN/dE y without oscillation, and when integrated over E y is clearly larger than 
the total number (29) of events accepted. The best fit with equation (20.3), mod¬ 
ified to take account of corrections such as energy resolution, is also shown. 
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It corresponds to A m\ t = 2.8 x 10 3 eV 2 and sin 2 (20^) = 1. The latter allows 
#n 3 = tt/ 4, cos 0^ = sin 0^ = l/y/2. 


20.3 Chooz 

Chooz is a village close to a French nuclear power station. The power station’s 
two reactors are rich sources of electron antineutronos 'v e . The fluxes and energy 
distributions, centred around 3 MeV, of these antineutrinos are very well understood. 
The detector, shielded from cosmic ray muons by its location deep underground, 
was positioned about 1 km from the reactors. 

The antineutrinos v e were detected by their inverse (3 decay interaction with 
protons, "v e + p + 1 .8 MeV —> n + e + , in a hydrogen rich paraffinic liquid scintil¬ 
lator. 

As with the K2K experiment, the oscillatory function sin 2 (Am^D/4^) is, 
from (19.20), negligibly small, < 2 x 10 -3 (taking D = 1 km, E y > 1.8 MeV). 
The MSW effect can also be neglected, since for material in the Earth’s crust V(z) ~ 
10“ 13 eV « Am^/2Ev < A m 2 2 /2E y . We can, again, to a good approximation, 
put ArapD/4£i, = Am\ x D / 4E y = Am 2 M D/4E y to obtain 

Po(y e -► V e ) = 1 - 41t/ e3 1 2 (1 - \U e3 \ 2 ) sin 2 (Am 2 M D/4E y ). (20.4) 

Setting |C/ e 3 1 — sin 0 e 3 , D = 1 km, A m 2 Al = 2 x 10 -3 eV 2 , we find from (19.20) 
P D (y e -* V e ) = 1 - sin 2 (20 e3 ) sin 2 [2.54(3 MeV/E^,)]. 

To the experimental precision obtained, there was no reduction in flux at the 
detector and no oscillation, and it was concluded (Apollonio et al., 2003) that 
sin 2 ( 20 e 3 ) < 0.18, which implies | t/ e 3 1 2 = < 0.05, the result we quote in Section 
20.1 of this chapter. 


20.4 KamLAND 

Like Chooz, the Kamioka Liquid scintillator AntiNeutrino Detector (KamLAND) 
experiment uses reactor antineutrinos. The sources are a group of nuclear power 
stations in Japan situated at various distances ~ 100 km to 200 km from the detector. 
As at Chooz, the detector makes use of the inverse ft decay V e + p —»■ n + e + . 

The experiment was designed to explore the Am 2 j ~7x 10 -5 eV 2 mass region. 
Lor a particular reactor at distance D from the detector, we have from (19.19) and 
setting | f/ e 3 1 2 = 0, that the survival probability is given by 

Pd(% -> Ve) = 1 - 4|[/ei| 2 |f/e 2 | 2 sin 2 (20.5) 
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Figure 20.3 KamLAND data (K. Eguchi etal. Ph ys. Rev. Lett 90 , 021802 (2003)). 
The energy distribution of the observed positrons in bins of 0.425 MeV (solid 
circles with error bars), along with the expected no oscillations distribution (upper 
histogram) and the best fit including oscillations using (20.5) (lower histogram). 
The shaded bands indicate the systematic error in the best fit distribution. The 
vertical dashed line corresponds to the analysis threshold at 2.6 MeV. 


and from the parameterization (14.16) 

4| t/ei I 2 | C/e 2 | 2 = cos 4 6»e3 sin 2 26» e2 ~ sin 2 20 e2 . 

As at Chooz, MSW effects are negligible. The measured positron energy spectrum 
is compared with the positron energy spectrum that would be expected if there 
were no antineutrino oscillations. This spectrum can be very well estimated from 
knowledge of the various reactor characteristics. 

Some results from KamLAND are shown in Fig. 20.3. The energy spectrum of 
the positrons is clearly below what it would be without oscillation. The best fit to 
the data using an expression based on (20.5) has 

\m\ x \ = 6.9 x 10 -5 eV 2 , 

0.84 < sin 2 20 e2 < 1. 

The KamLAND analysis took some account of systematic errors arising from the 
simplifying assumption | t/ e 3 1 2 — 0. 
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20.5 Atmospheric neutrinos 

The Earth is continually bombarded by cosmic rays, which consist for the most part 
of high energy protons and electrons. The protons, in their collisions with nuclei 
in the upper atmosphere, produce n mesons. The tt mesons decay by the chains 
(Section 9.2, Section 9.4): 

7T + —»■ p. + + -v^ , n 

I -^e + +\ + ^ 

The neutrinos and antineutrinos are produced at a mean height ~ 20 km, with 
energies extending to the multi-GeV region. The ratio of the flux "v^ + 7^ to the 
flux of w e + "v e is evidently about 2. 

In water detectors, such as Super Kamiokandi, charged leptons are produced 
through reactions essentially of the form 

w e + n—>e“+p, 'v e + p^e + + n; 

~v n T n > p. - T p , vv + P + n. 

The charged leptons emit Cerenkov radiation, which provides information on the 
energy, direction and identity of the incident neutrino. 

Figure 20.4 shows some results from the Super-Kamiokandi detector. The plots 
show the ratio of observed ~v e - and w u -likc events to Monte Carlo calculations 
in the absence of oscillations, as a function of D/E y . E y is the neutrino energy 
and D the distance from the point of production ~ 20 km above the Earth’s sur¬ 
face, to the detector. D is then inferred from the measured neutrino direction. 
For multi-GeV electron neutrinos, the MSW modification to the equations has 
to be included for those neutrinos passing through the Earth on their way to the 
detector. 

The -v e data show no sign of oscillation, but there is a clear deficit of muon 
neutrinos. The best fit to the data has A m 2 M = 2.2 x 10 -3 eV 2 , and like K2K has 
sin 2 29^3 = 1, where for D/E y < 10 3 km/GeV the Am 2 2 and Am 2 , oscillations 
are combined into one A m\ t oscillation. The absence of discernible "v e —> ~v e oscil¬ 
lations in the data was the first indication of the smallness of | t/ e 3 1 2 , which again 
implies that the A m 2 M oscillations are predominantly between "v u and y r . 


0 

I —^e“ +w e +v^ 


20.6 Solar neutrinos 


The nuclear and thermal physics of the Sun is well understood. The solar neutrino 
spectra predicted by the Standard Solar Model and shown in Fig. 20.5 may be 
assumed with confidence. 
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Figure 20.4 Data from Super Kamiokande (Y. Fukunda et al. Phys. Rev. Lett. 
82 , 1562 (1998). The ratio of measured events to expected events in the absence 
of oscillations. The lines show the expected shape for -o- "v. with A/n 2 1 = 
2.2 x 10 ’ eV 2 and sin 2 (2d^) = 1. There is no significant "v e -o- -v e oscillation 
observed. 


The first measurements of the spectra were made by R. Davis and his collabora¬ 
tors in the deep Homestake mine in the U.S.A, (Davis 1964). The detection of the 
neutrinos was made through the reaction 

w e + 17 CI + 0.81 MeV —> e + j^Ar. 

The Super-Kamiokande detector also made measurements of the solar neutrino 
flux with E y greater than about 6 Mev (Fukuda et al. 1996). 

Because of the high energy threshold these measurements were blind to the 
principal flux from the ‘pp’ reaction. The GALLEX (Italy) and SAGE (Russia) 
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Figure 20.5 The solar neutrino spectra predicted by the standard solar model. 
Spectra for the pp chain are shown by solid lines and those for the CNO chain by 
dashed lines. (See Bahcall, J. N. and Ulrich, R. K. (1988), Rev. Mod. Phys. 60, 
297.) 


experiments were designed to remedy this, and examine the pp flux through the 
reaction (Hampel et al, 1999; Gavrin et al., 2003) 

w e + jjGa + 0.23 MeV -* ™Ge + e“. 

The SNO (Sudbury Neutrino Observatory, Canada) is a heavy water detector. Neu¬ 
trinos, with E v greater than about 5 Mev, are detected through the reactions, (Ahmad 
et al. 2002) 


w e + D 2 + 1.44 MeV —> e + p + p, 
w e + D 2 + 2.22 MeV —> p + n + -v. 

The first of these reactions is a charged current interaction and can be initiated only 
by an electron neutrino. The second is a neutral current interaction, initiated with 
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equal probability by an electron, muon or tau neutrino. The SNO experiment also 
measured the reaction rate of elastic neutrino scattering from electrons, 

•v + e“ —»■ "v + e“. 

Again, this reaction can be triggered by a neutrino of any type. Measurements can 
be used to infer both the "v e flux </>Cv e )) and the total flux /(w e + "v^ + "v T ). 

The early results from the Homestake detector gave a measured flux of only 
about one third of that expected from the standard Solar Model without oscillation. 
Super Kamiokande, GALLEX and SAGE gave about half the expected rate. SNO 
found that 


-——-= 0.306 ± 0.05. 

</>Oe + "V + Wr) 

The measured total neutrino flux was consistent with that expected from the Stan¬ 
dard Solar Model and clearly, since the Sun produces only electron neutrinos, many 
have made the transition to -v Ll and ~v T . 


20.7 Solar MSW effects 

We showed in Section 19.4 that plane wave neutrino mass eigenstates depended on 
functions /, (z), that satisfied 

df mj 

te = 2E fi + V(z)U: > Ueifj - (20 ‘ 6) 

The source of solar neutrinos is the central region of the Sun, where the Standard 
Solar Model gives E(o) = 7.6 x 10 -12 eV. Comparing this with ISm\ x /2E, which 
with the ‘reference parameters’ of Section 20.1 equals 3.5 x 10 _ 12 (10MeV/£/), it 
is clear that the interpretation of the data from solar neutrino experiments requires 
a serious consideration of the MSW effect. 

As a starting approximation we again neglect the small term U e 3 . With U e 3 = 0 
the solution of ( 20 . 6 ) for fj,(z) is 

Mz) = e m ^' 2E h{ o), 

independent of V(z)- With U e 3 = 0, and since the initial neutrino is an electron 
neutrino, /TO) = 0 and it follows that fo(z) is zero for all z: it plays no part in 
the oscillations. The approximation again reduces the analysis to a two-neutrino 
phenomenon in f\{z) and After some algebra it can be shown that the solar 
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neutrino data can be analysed with the equations 

l ~7~ = ^r-(-cos(2 0e2)/e + sin(2(9 c2 )/ x ) + V(z)fe 
d z 2 E 

df x Am?, 

i-f- = —f i (sm(20 e2 )/ e + (cos29 e2 )f x ). 
dz 2 E 


(20.7) 


f x = c ^3 / u — s^/x is a combination of / u and / T , V(z) is known from the Standard 
Solar Model. The equations have to be integrated numerically. 

All the solar neutrino data is consistent with the oscillation interpretation, and 
analysis of the data gives 3 x 10 _5 eV 2 < Am?j < 1.9 x 10 -4 eV 2 , 30.2° < 0 e 2 < 
34.9° with high probability (95% confidence level). The best fit is with Am?j = 
6.9 x 10“ 5 eV 2 ,(9e2 = 32°. 

The solar neutrino data give a tighter constraint on 0 e2 than KamLAND. Also, 
with the MSW effect, the solution of equations (20.7) depends on the sign of A m\ v 
It is found to be positive, as is indicated in Figure 20.1 


20.8 Future prospects 

There are several planned experiments that will make a more thorough investigation 
of neutrino masses and mixing phenomena. Apart from the possibility of sterile 
neutrinos, indications of which have not been confirmed, there is no evidence to 
contradict the three-neutrino theory of Chapter 19. However, it can be seen from 
the quality of the data presented in this chapter that the neutrino mass theory is not 
as well established as other branches of The Standard Model. Within the theory 
experiments are planned to make more precise measurements of the Am 2 and the 
parameters of the neutrino mixing matrix. 

The principal focus of experimental activity is on the construction of muon 
neutrino beams as in the K2K experiment. An advantage of accelerator-generated 
neutrinos is the control that one has on the flux and energy distribution. K2K is 
an ongoing experiment but by late 2006 the muon neutrino experiments CNGS 
and MINOS (Main Injector Neutrino Oscillation Search) will be in operation. The 
CNGS neutrinos are generated at CERN and detected at the GRAN SASSO under¬ 
ground laboratory in Italy. The MINOS beam is generated at Fermilab and detected 
in the Soudan mine in Minnesota. Both experiments will look for evidence of the 
rare -v^ —> y e transition and for the expected -v u —> xr T oscillations. If the theory of 
Chapter 19 is not challenged it is expected that by 2010 we will have much tighter 
bounds on both sin 2 (29 e 3) and | Am^ t |. 

In the more distant future a new very high intensity proton accelerator will be 
built at Tokai, Japan. The experiment T2K will take over from K2K with a neutrino 
beam of much higher intensity. Detection at Super Kamiokande will give a base line 
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Figure 20.6 The upper curve is Ppiy^ -> "Ve). 

The lower curve Po(y^ —> "V c ). 

The parameters are A m 2 2 — 2 x 10 ~ 3 eV 2 , Aw 21 = 7 x 10 -5 eV 2 , 
cos 0 e 2 = 0.84, cos 6^3 = 1/V2, sin 0 e 3 = 0.05, <5 = 7 t/ 4 , D = 295 km. 

The MSW effect, which depends on the local geology will be significant but 
calculable. It is not included here. 


D ~ 295 km. An upgrade to higher intensity for MINOS is also planned with a new 
experiment NOwA. By 2015 with T2K and NO"vA it is expected that if sin 2 (26^3) > 
0.01 then it will be detected. The MSW effect will influence these measurements and 
the sign of A m\ 2 could be established, and hence the mass ordering. If sin 2 (26f,3) can 
be measured then it is also possible to have a measurement of the CP violating phase 
8. Figure 20.6 shows the transition probabilities Po(y ^ —> "V e ) and Poiy^ —>■ "V e ) 
as a function of Ey with the T2K baseline. <5 is taken as 45° and the other parameters 
are a plausible set. Although the probabilities are small, the particle and antiparticle 
probabilities differ considerably (see Section 19.3). 
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Majorana fields were introduced in Section 6.6. If neutrino fields are Majorana, 
then there is no distinction to be made between neutrinos and antineutrinos. As 
explained in Section 6.7, the smallness of neutrino masses makes the differences 
between Dirac and Majorana neutrinos difficult to discern experimentally. 

In this chapter we elaborate on the theory of Majorana neutrinos and show 
how they can be accommodated within the Standard Model. Finally we describe 
experiments on ‘double (3 decay’ that may determine the nature of neutrinos. 

21.1 Majorana neutrino fields 

We shall denote left-handed and right-handed Majorana neutrino fields by V\ix) 
and v R (x). From (6.28 and 6.29), making the identifications 

= r/p-(-, bp— = dp_ 

we have for a Majorana neutrino field carrying mass m 

[(Ve-' 72 l+» + Ve’' 2 I-) e i<pT ~ £f) 

+ (b* p+ e e ' 2 I-) - Z?;_e- 0 / 2 1+)) e i(-p ' r+£n ] , 

( 21 . 1 ) 

V* = E /t(Ve 9/2 1+)) + Ve- 0/2 I-) 

+ {-b; + e~ e/2 |-) + b* p _e e ' 2 |+)) e i(_p r+£f) ] . 

( 21 . 2 ) 

The fields \>\(x) and \>r(x) are not independent. It is easily shown, using Problem 
6.5, that 

K) I —)* = |+>, (ia 2 )|+)* = -|-), 
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and then that 


v R = (ier 2 )v R and v L = — (i<7 2 )v R . (21.3) 

Thus either field may be derived from the other. As a consequence, only left-handed 
Majorana fields or only right-handed Majorana fields need appear in any theory. 
The charge conjugate field v[ was defined in (7. 1 lb) by 

V L = -(icr 2 )v*. 

But by the results above — (ier 2 )v R = vl, so that 

v[ = vl. (21.4) 

Thus the charge conjugate of a Majorana field is identical to the field. There is no 
room in the theory of Majorana neutrinos for a distinguishable antineutrino. For 
a given momentum, there are two basic particle states, which we may take to be 
one with helicity + l /i, the other with helicity — ] /i. (In these respects, Majorana 
neutrinos are somewhat similar to photons, but with photons having helicities ±1). 


21.2 Majorana Lagrangian density 

The Majorana field is constructed from solutions of the Dirac equation. We saw in 
Section 5.2 that the Lagrangian density for a free Dirac particle of mass m is 

£ Dirac = i^LO^i/T + i^cr^df.xl/R - + iArVt)- 


In the case of a Majorana field, v R is determined by v L , and given by (21.3) above. 
We choose to work with v L , and therefore take the Majorana Lagrangian density to 
be 


1 r 


£ m = - 


iv'd^S^v + i(iCT 2 v*) t CT /x 3 iU (icr 2 v*) — m {v^(ia 2 )v* + v T (— icr 2 )v} 


where v = v L - For the remainder of this chapter we shall drop the subscript L, 
for clarity of notation, v is a two component left-handed neutrino field. We have 
introduced a factor of '/ 2 to compensate for double counting. 

The second dynamical term in £ M is equivalent to the first (Problem 21.1), so 
that the Lagrangian density may be written 

£ m = i v^a^d^v — — {v^ (icr 2 ) v* + v T (— icr 2 ) v} . (21.5) 


It is interesting and important to note that, with finite mass m and with the 
Majorana constraints, we lose the £7(1) symmetry that gave neutrino number 
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conservation in the Dirac case (Section 7.1). We shall see that with Majorana 
neutrinos the overall lepton number is no longer conserved. 

Noting the factor in the Lagrangian density, the Hamiltonian operator H and 
momentum operator P for Majorana neutrinos are (see Section 6.5) 

H = \ E Ka. - b p.*;.) £ p = E (KM E v 

■" (21.6) 

p=^ E Wa. - vW p=E (9A<) p. 

Z p,£ P,£ 

where e = ± 1 is the helicity index. 


21.3 Majorana field equations 

A variation in the Majorana action yields the field equation 

ia^d^v = m (icr 2 ) v*. 

(Note that there are two contributions from the mass term in the Lagrangian density.) 

In a frame K' in which the Majorana neutrino is at rest, /?■ v' = — i9 • v' = 0 (i = 
1, 2, 3), and the field equation reduces to 

i ^ = m (irr 2 ) i/* (21.7) 

It is easy to verify that this equation has two solutions of the form 

vj = be~' Et + b*e iEt ' and u' = be~ iEt ' - b*e' El ' (^j , 

with E = m. (21.8) 


We may then, as in Section 6.3, transform to a frame K in which the Majorana 
neutrino is moving with velocity v > 0 in the Oz direction: 


vi = M-Vj 


(e- e ' 2 0 

V 0 e 0 / 2 


_ mt’ e 6»/2 


be~ imt ' ^ + b*e bnt ' ^ 
1 j +b * e imt' e e/2 


Substituting t' — t cosh# — z sinh#, 


Vl 


fee” 0 / 2 



e i( P z-Et) + b * e e/2 



£ i (-pz+Et) 


(21.9) 
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Similarly there are solutions of the form 


v 2 = be. e/2 



giipz—Et) _ b * e -e/2 



gi (~PZ+Et) 


( 21 . 10 ) 


All other plane wave solutions may be generated from these by rotations, and 
we recover the general field (21.1). 


21.4 Majorana neutrinos: mixing and oscillations 

The most general Lorentz invariant Majorana mass term that can be introduced into 
a Lagrangian density is 

AnassM = Ed (— icr 2 ) Vpm a p + Hermitian conjugate. (21.11) 

^ a,p 

a and ft run over the three neutrino types, e, p and t; v a , Vp are left-handed Majo¬ 
rana fields; m a p is an arbitrary complex matrix. In contrast to the case of Dirac 
neutrinos, m a p can be taken to be symmetric. This is because fermion fields anti¬ 
commute, so that v J (— ia 2 ) Vp is symmetric on the interchange of a and ft (see 
Problem 21.2). 

A general symmetric complex matrix can be transformed into a real diagonal 
matrix with positive diagonal elements by means of a single unitary matrix U (see, 
for example, Horn and Johnson (1985)). If m a p = mp a , we can write 

3 

tEi/p — ^ ^ bJ a i tn j Upt, (21.12) 

i=t 

where the m, are three positive masses. Note that U has no phase ambiguities, 
whereas Dirac neutrinos have phase ambiguities (see (19.2)). 

If we now define the fields 


Vi(x) = ^ U m v a (x), 

a 

the mass term takes the standard Majorana form: 

1 


(21.13) 


- m,v- (—icr 2 ) v, + Hermitian conjugate. 


The dynamical terms in the Lagrangian density keep the same form under the 
transformation: 

Aiyn = = J2 iv ^ l " d ^ Vi - 
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(£ dyn + £ mass ) is the Lagrangian density of free Majorana neutrinos of masses 
mi, m 2 , m 3 . Inverting equation (21.13), the neutrino fields v a (x) appear as mix¬ 
tures of the neutrino fields of definite mass: 



(21.14) 


This is of the same form as equation (19.6) for Dirac neutrinos. The consequences 
for the weak currents and neutrino oscillations are the same as in Section 19.2 and 
Section 19.3 for Dirac neutrinos but antineutrinos are interpreted as the neutrinos 
that accompany a negative charge lepton in weak interaction decays. 

21.5 Parameterisation of U 

A 3 x 3 unitary matrix U is specified by nine real parameters, but by absorbing 
phase factors into the definition of the lepton fields, as in Section 19.6, U al can be 
redefined as 



without changing the physical content of the theory. Thus U can be characterised 
by 9 — 3 = 6 parameters. The Dirac neutrino mixing matrix (Section 19.6) is deter¬ 
mined by four parameters, and requires extension, to include two more parameters. 
One may take 



(21.15) 


Potentially we have two more CP violating parameters. However and At make 
no contribution to the CP violation of the oscillation phenomena of Chapters 19 
and 20 (see (19.19) and Problem 21.3) 


21.6 Majorana neutrinos in the Standard Model 


To bring Majorana neutrinos carrying mass into the Standard Model, we must 
maintain the SU{2) symmetry of the weak interaction. As in the case of Dirac 
neutrinos, a suitable SU{ 2) invariant expressions that we can construct from the 
Higgs doublet field and a lepton doublet L„ is (d> T e L a ) (See Section 19.5). On 
symmetry breaking, this becomes (<t> T s L a ) = — (</>o + h/^/2)v a . 

</>0 ~ 180 GeV is the Higgs field vacuum expectation value and h(x) is the Higgs 
boson field. 
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From these SU(2) invariant expressions we can construct an SU(2) invariant 
Lagrangian density that on symmetry breaking becomes 

Anass = — ^(00 + h/\fl) 2 vj(—io 2 )vpK a p + Hermitian conjugate. 

(21.16) 

The matrix K a p couples the neutrino fields to the Higgs field, and we can identify 
the mass term 


m a p = 4>lK afi . (21.17) 

Hence the coupling matrix K has dimension (mass) -1 , which implies (see Section 
8.4) that it is an ‘effective’ Lagrange density. Coupling terms such as this render 
the theory unrenormalisable. 


21.7 The seesaw mechanism 

To address the question of renormalisability consider the Lagrangian density 

£ = h4<7%v L + i^ (i R T a 2 R - iR ] a 2 R*) - /xv{r - /xR^v L . 

(21.18) 

M and // are mass parameters; v L and R are two component left-handed and right- 
handed spinor fields respectively. Discarding the terms coupling v L and R. the 
Lagrangian density is that of a massless left-handed neutrino field v L , and a right- 
handed Majorana neutrino field carrying mass M. 

We now suppose that M is so large that the dynamical term iR^o^d^R may be 
neglected, to leave 

£ = iv^or^VL - -^-{R t (icr 2 )R - R\ia 2 )R*) - /rv^R-/^ v L . (21.19) 

A variation 8R* in the action gives the field equation for R: 

Mio 2 R* - /xv L = 0. 

And multiplying by i o 2 /M we obtain 

R = -(/x/M) io- 2 v*. (21.20) 

Substituting back into (21.19) gives the effective Lagrangian density 

£ = ivl&^d^VL + (n 2 /2M)(v[ia 2 vl + vI(—io 2 )vl^. (21.21) 
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The sign of the mass term can be changed by making the phase change v L —* 
\>[ = iv L . The effective £ is then a free neutrino field of mass in = /r 2 /M. Taking 
for ji a typical lepton mass, say the mass of the muon (10 2 MeV), we can make 
in the magnitude of a neutrino mass by taking M sufficiently large, >10 7 GeV. 
The generalisation of the seesaw mechanism to include three neutrino types is 
straightforward. 

Taking R to be an SU{2) singlet, the Lagrangian density (21.19) can be made 
compatible with the Standard Model by replacing /iv^R with the SU(2) invariant 
C(L[(p)R, and similarly replacing jiR'v, where C is a dimensionless coupling 
constant. After symmetry breaking, /xv[r becomes C o + h {x)/s/l^ u, R and 
setting aside the coupling to the Higgs boson, the mass /x = Ctpo- It should be 
noted though that although there are no dimensioned coupling constants the mass 
M is not generated by the Higgs mechanism. 


21.8 Are neutrinos Dirac or Majorana? 

The principal feature that distinguishes massive Majorana neutrinos from massive 
Dirac neutrinos is that Majorana neutrinos do not conserve lepton number. As 
pointed out in Section 21.2, in the Majorana case the 17(1) symmetry that gives 
lepton number conservation in the Dirac case is lost. The experimental observation 
of a lepton number violating process would therefore be of great interest. ‘Double 
(3 decay’ is the most promising phenomenon for investigation. 

The first direct laboratory observation of double |3 decay was made in 1987, with 
the decay 

82 82 

Se ^ Kr + e“ + e“ + v e + v e + 3.03 MeV. 

34 36 

The mean lifetime for this decay has been measured to be (9.2 ± 1) 10 19 yrs. 

If neutrinos are Dirac particles, v e is the appropriate symbol in this decay. 
If neutrinos are Majorana particles, v and v are identical. The observed decay 
does not distinguish between the two interpretations. The process is illustrated in 
Fig. 21.1a. An electron and a v in the Dirac case, or a v in the Majorana case, are 
created at each interaction point at which a d quark is transformed into a u quark. 
The nucleus becomes 3 ( Br, possibly in an excited state, between the interaction 
points. 

If neutrinos are Majorana, the decay might be a neutrinoless double |3 decay, as 
envisaged in Fig. 21.1b. The neutrino created at Xi is annihilated at X 2 , giving a 
change of 2 in lepton number. This process is not available if neutrinos are Dirac 
particles. In the absence of neutrinos to share the energy, the sum of the energies of 
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Figure 21.1 (a) Illustrates the two neutrino double [ J > decay of ^Se. The decay 
occurs at the second order of perturbation theory in the weak interaction and 
involves a sum over many states of *?Br (denoted by 

(b) Illustrates the neutrinoless double |3 decay, a Majorana neutrino created in the 
transition jjSe —»■ ^Br* is annihilated in the transition ggBr* —>■ ^Kr. In pertur¬ 
bation theory this involves a sum over all momentum states of the neutrino as well 
as many states of pB r. 


the two electrons emitted would be sharply peaked at the decay energy. (The recoil 
energy of the nucleus would be small.) 

Double (3 decay and neutrinoless double (3 decay occur at the second order 
of perturbation theory in the effective weak interaction of equation (14.22). For 
Majorana neutrinos, double |3 decay and neutrinoless double (3 decay are competing 
processes. Neutrinoless decays are heavily suppressed. From the field equation 
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Table 21.1. From Elliot and Vogel hep/ph/0202264 Feb 2002 


Nucleus 

t \/2 (years) 

Measured 0u half life 
Estimate T^J 2 (years) Lower limit (years) 

48 Ca 

(4.2 ± 1.2) 10 19 

(2.2 ± 1.3) 10 25 

> 9.5 x 10 21 

76 Ge 

(1.3 ± 0.1) 10 21 

(3.2 ± 2.4) 10 25 

> 1.9 x 10 25 

82 Se 

(9.2 ± 1.0) 10 19 

(1.3 ± 1.0) 10 25 

> 2.7 x 10 22 

100 Mo 

(8.0 ± 0.6) 10 18 

(8.4 ± 7.2) 10 26 

> 5.5 x 10 22 

116 Cd 

(3.2 ± 0.3) 10 19 

(1.0 ± 0.9) 10 25 

> 7.0 x 10 22 


(21.1), the decay amplitude for the neutrinoless mode, with an intermediate neutrino 
of mass m, and energy E v , is proportional to 

(mj/2E y ) Le- 0/2 e 0/2 + e^e^ 2 ] = 

The two terms come from the two helicity states. The corresponding factors in two 
neutrino (3 decay are dominated by the term (m f /2E y )e e , and e° ~ 2 cosh 0 = 
C lE-v/mi ), giving unity. 

With three neutrino mass eigenstates the decay rate will be proportional to 
(l/f? 2 )|^,- miU e t| 2 where E y is some mean neutrino energy that can be expected 
to be a nuclear excitation energy. 

Table 21.1. gives some measured two neutrino (3 decay half lives, and corre¬ 
sponding estimates of the half lives of the neutrinoless decays. These theoretical 
estimates are sensitive to the nuclear model used. 


Problems 

21.1 Show that (icrV)^9„(io-V) = 1 ^( 5 %v. 

21.2 Show that, taking account of the anticommuting spinor fields, 

T 2 T 2 

v a a Vp = VpO iv 

21.3 Denoting the Majorana and Dirac mixing matrices by U M and U n , show that 
UpjUaj* ~ UjijUaj* anc ^ hence that the phenomenology of mixing is the same for 
both Majorana and Dirac neutrinos. 
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Anomalies 


In the Standard Model, the fermion fields of the leptons and quarks interact through 
the mediation of vector bosons. As we remarked in Chapter 10, the renormalisability 
of the Model requires the vector boson fields to be introduced through the mecha¬ 
nisms of local gauge symmetry. Renormalisation requires the insertion of counter 
terms in the Lagrangian (Chapter 8). It is important that the counter terms maintain 
the local gauge symmetries, along with their corresponding conserved currents. As 
a consequence, one of the global current conservation laws of the Standard Model, 
that we have obtained by treating the fields as classical fields, has to be modified 
when the classical fields are quantised. This is an example of an anomaly. We shall 
see that baryon number and lepton number are not strictly conserved quantities in 
quantum field theory. 


22.1 The Adler-Bell-Jackiw anomaly 


Bell and Jackiw and, independently, Adler were the first to find an anomaly in a field 
theory (see Treiman et al ., 1985). They were concerned with the axial vector current 
associated with the chiral symmetries introduced in Section 16.7. To appreciate the 
nature of this anomaly, consider the model Lagrangian density 



( 22 . 1 ) 


This has the local gauge symmetry of electromagnetism; it is invariant under the 
transformation 


i/s(x) —> i/s'(x) = e iqx ^\[r(x), 
A^(x) -* Afa) = A nix) + 3 M y(x). 


( 22 . 2 ) 
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If m = 0, £ also has a global chiral symmetry: it is then invariant under the 
transformation 

ir(x) -* t//(x) = e iay5 xlr(x), (22.3) 

as may easily be verified using the properties of the y matrices (Section 5.5). 

Applying the transformation (22.3) to the Lagrangian density (22. 1), with a taken 
to be infinitesimal and space and time dependent, gives an infinitesimal change 8£ 
in £ which (after an integration by parts in the action) may be taken to be 

8£ = a(x)[d p j^ - 2imi^y 5 r/r], 

where 

]}'a = ifry^Y 5 ^ (22.4) 

is the axial current. (See Problem 5.6.) 

It follows from Hamilton’s principle that, for fields that obey the field 
equations, 

= 2i mfy 5 f. (22.5) 

If m = 0, the axial current is conserved: 

9,Ja = 0 if m = 0. (22.6) 

The results (22.5) and (22.6) have been obtained treating the fields as classical fields. 
In quantum field theory the fields become quantum operators, and the currents can be 
calculated in perturbation theory. It is found that in order to keep the electric charge 
conserved and maintain electromagnetism as a local gauge symmetry, perturbation 
theory requires 

= 2i mfy s f - — e ^»d p A v d x A p . (22.7) 

With m — 0 the axial current is not conserved, but instead 

KJa = ~^y vXp d^A v d x A p . ( 22 . 8 ) 

This is the Adler-Bell-Jackiw axial anomaly. It is found to be the only anomalous 
term in d p j^. Using Problem 4.3, we can write (22.8) in the explicitly gauge 
invariant form 

e 2 

BJZ =- 2 E B (22.9) 

7T 2 

It is interesting to note that from (22.8) we can construct a current 

■/total = JA + ( 22 - 10 ) 
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which evidently is conserved: 

9/X ai = 0- (22-11) 

•/total ' s gauge dependent (it contains A v ) and hence lacks immediate physical sig¬ 
nificance. Nevertheless it follows from (22.11) that the charge 

<2(0 = f 7totai d 3 * (22.12) 

is constant in time. Q(t ) is a gauge invariant quantity. 


22.2 Cancellation of anomalies in electroweak currents 

In the Standard Model, there are anomalies that have an origin and structure similar 
to the axial anomaly described in Section 22.1. In particular in the electroweak 
sector the gauge bosons couple to currents that have both vector and axial vector 
components, as, for example, in (12.15) where 

ft = = ey»( 1/2)(1 - y 5 )w e . (22.13) 

It is the mix of vector and axial vector that gives rise to anomalies that threaten 
the renormalisability of the electroweak sector. Detailed calculations show that, in 
a theory that has only leptons and no quarks, anomalies do spoil the conservation 
laws of the currents that couple to the bosons. Conversely, in a theory with only 
quarks and no leptons there are again anomalies. Remarkably, in a theory which 
includes both leptons and quarks the anomalies cancel exactly, provided that the 
number of lepton families is equal to the number of quark families, and then the 
electroweak gauge currents are strictly conserved (t’Hooft, 1976). Thus equality in 
the number of lepton families and quark families is of fundamental importance to 
the renormalisability of the Standard Model. 

There are no serious anomalies associated with the gluon fields of the strong 
interaction. 


22.3 Lepton and baryon anomalies 

We now turn to the currents that, classically, arise from global symmetries and 
conserve the number of leptons and the number of quarks. We will first consider 
the situation if neutrinos are shown to be Dirac fermions. For Dirac neutrinos there 
is a conserved lepton current given by (22.25) 

•AeptonM = 22 [“L CO CO + <*R CO O^R CO + V^CO CO 

a=e,\i,r 

+ v 1r (x)tT ll v aR (x)]. (22.14) 
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and classically 


Wepton) = 0. 


(22.15) 


On quantisation, this current is not conserved. The divergence equation has to 
be modified in a way reminiscent of (22.8) and becomes 



The fields W^, B /;|) , and the coupling constants gj and g 2 , were introduced in 
Chapter 11. 

The total quark number is also classically conserved but the same anomalous term 
as in (22.15) arises when the quark fields are quantised for each colour. Summing 
over the three colours we have 



(22.17) 


Since baryon number is one third of the quark number, this can also be written 


a _ a /A 1 

u M J baryon — u M J lepton’ 


(22.18) 


Where lepton = J e + •'muon + -C 


Thus if neutrinos are Dirac particles, anomalies reduce the two classically con¬ 
served currents of the Standard Model to one that can be taken as yon — pton . 
The independent current 7^' irvon + ton is not conserved. 

Let us now consider the lepton number current. This is not conserved but, as we 
found with the chiral anomaly, there is nevertheless an associated current that is 
conserved, and we may write 



(22.19) 


where 


Jt = ^i^ VXp T^ Tr (W.W* - fe/3) W y W A W p ) - g\B v B Xp 


( 22 . 20 ) 


Jj is called the topological current, and 



( 22 . 21 ) 


is the topological number. 

The lepton number is defined to be 



( 22 . 22 ) 
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and it follows from (22.19) that Mepton — Ap is constant in time. If Ap changes by 
ANj, then Mepton changes by AMepton, and AMepton = AA r . 

22.4 Gauge transformations and the topological number 

Is the topological number a gauge invariant? For simplicity we shall restrict our 
discussion to fields that are gauge transforms of the vacuum field configuration. 
Then from (11.4b) and (11.6) 

Bn = (2/ gl ) d„d, (22.23) 

W M = (2//g 2 )(3 / ,U)U t . (22.24) 

The field strengths B /u; and W ; ,,, are of course zero everywhere. Also we shall only 
consider gauge transformations in a local region of space, so that 9 —»■ OandU —I 
as r —> oo. The topological number for this vacuum configuration is 

Ap = -^2 / s ° ijkTl {( 9 < u ) Ut ( 9 ? U ) Ut ( 9 * U ) Ut ) d3x ’ (22.25) 

using (22.24) in (22.20). 

It can be shown that Ap is an integer multiple of 3, 0, ±3, ±6, ... We can illus¬ 
trate this by considering unitary transformations of the form 

U(jc) = cos/(r)I + i sin f(r)( r • r), (22.26) 

taking a — f(r) r in (B.9). Here f{r) is a function with the property that f{r) —> 0 
as r —> oo,so thatU —> I as r —> oo. If U(x) is to be defined at r — 0, then sin /(r) 
must vanish there (since r is not defined at r = 0). Thus we require/(0) = nit where 
n is an integer. Subject only to the boundary conditions at r = 0 and r —> oo, f(r) 
can be any continuous and differentiable function. 

If n = 0, f{r) can be deformed continuously to give f{r) = 0, U = I, for all r; 
transformations like this are called ‘small’ unitary transformations. If n ^ 0 there 
is no way in which/(r) can be deformed continuously to give U — I for all r; these 
are ‘large’ unitary transformations. Direct computation of (22.25) with U of the 
form (22.26) gives 

6 f n7T 

Nt = — sin 2 fdf = 3 n. (22.27) 

7T Jo 

It appears that in a theory with no fermions there would be many inequivalent 
representations of the vacuum state, characterised by a topological number Nt. 
Neglecting the fermions, and treating the SU( 2) x 1/(1) gauge fields and the Higgs 
field classically, it is found that to change At continuously by one unit involves 
field distortions that require energy. Estimates suggest the energy barrier in field 
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configurations is of height a few times {An/g\) M w ~ 100 M w . Treating the fields 
as quantum fields, t’Hooft (1976) found that quantum tunnelling can take place 
through the barrier, but the probability per unit volume in space-time of a change in 
Nj is very small because of a very small tunnelling factor exp( — 1 6n 2 /g?) ~ 10 “ 173 . 

22.5 The instability of matter, and matter genesis 

Including the fermions in the Standard Model, if the Higgs and gauge fields 
pass over the energy barrier separating different topological sectors, the fermion 
fields must also evolve. Suppose, for example, that A_/Vi ep ton = — 3 and, from 
(22.18), AiVbaryon = — 3. These conditions are satisfied by, for example, the decay 
3 He —»• e + + q + + -v T . 

With suppression factors like 10 -173 , it is unlikely that any helium nucleus in 
our galaxy has ever decayed in this way since helium nuclei were formed. 

It is nevertheless an intriguing possibility that the matter content of the Universe 
could have been generated by an anomaly mechanism. In the Big Bang model of 
cosmology, at the very early stage in its evolution the Universe was intensely hot, at 
a temperature high compared even with the barrier height separating the different 
topological sectors. Thermal fluctuations over the barrier would produce matter 
or antimatter depending on the sign of A Nj. In the beginning the net baryon and 
lepton numbers might both have taken the symmetrical value zero. To generate the 
observed preponderance of matter over antimatter requires CP violation, and this 
is an attribute of the Standard Model. 

The modifications are straightforward if neutrinos are Majorana fermions. For 
example, with the Majorana Lagrange density of (21.11), (22.19) becomes 



(22.28) 


as can be shown by making an infinitesimal, space time dependent, phase change 
on all the lepton fields (see the method of section (22.1)). If neutrinos are Majorana 
particles then, with the anomalies, no global conservation laws remain. 
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Reductionism complete? 

The Standard Model, extended to include neutrinos carrying mass, gives a remark¬ 
ably successful account of the experimental data of particle physics obtained up 
to 2006. Any subsequent theory must, in some sense, correspond to the Standard 
Model in the energy range that has so far been explored. 

Many questions remain to be answered. Why is there the internal electroweak and 
strong group structure C/( 1) x SU{2) x SU{3), with the three coupling constants 
g !, g 2 , g 3 ? Is the origin of mass really to be found in the Higgs field with its two 
parameters: the Higgs mass and the expectation value of the Higgs field? In the 
electroweak sector, why are the masses of the charged leptons as they are? There 
are three parameters here. Another set of parameters comes with allowing neutrinos 
to have mass: three neutrino masses and four parameters of the mass mixing matrix 
(or six if it appears that neutrinos correspond to Majorana fields rather than Dirac 
fields). In the quark sector ten more parameters are introduced: six quark masses, 
and four parameters in the Kobayashi-Maskawa matrix. 

Are these twenty five or twenty six parameters really independent? 

Some of these questions may be answered when experimentalists have the LHC 
(Large Hadron Collider) at CERN, probing to higher energies and thereby to smaller 
distances to make progress into finding common origins of what are now diverse 
elements of the Standard Model. The task is to reduce twenty six parameters to one 
or two, say, before closing the book on the theory of matter and radiation. 
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An aide-memoire on matrices 


A.l Definitions and notation 


An m x n matrix A — ( A , ; ); i = 1, ..., m; j = 1is an ordered array of mn 
numbers, which may be complex: 


( A11A12 . . . A\ n ^ 
^ 21^22 • • ■ 


V A m \ ... A mn J 

A,j is the element of the /th row and jth column. 

The complex conjugate of A, written A*, is defined by 


A* = (A*)- 

The transpose of A, written A T , is the n x m matrix defined by 

A J — A- 

The Hermitian conjugate, or adjoint , of A, written At, is defined by 

A), = A*j = A]*, or equivalently byA f = (A T )*. 

If k, p are complex numbers and A, B are m x n matrices, C = /,A + p \\ is defined 
by 


Cij — t -Ajj 4“ p.Bij. 

Multiplication of the m x n matrix A by an n x / matrix B is defined by AB = C, 
where C is the m x / matrix given by 


Oik — Ajj Bjk. 

We use the Einstein convention, that a repeated ‘dummy’ suffix is understood to be 
summed over, so that 

n 

Ajj Bjk means E AijB jk . 
j= 1 
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Multiplication is associative: (AB)C = A(BC). If follows immediately from the 
definitions that 


(AB)* = A*B*, (AB) t = B T A T , (AB)+ = B^. 

Block multiplication: matrices may be subdivided into blocks and multiplied by a rule 
similar to that for multiplication of elements, provided that the blocks are compatible. For 
example, 

/a b\ /e\ _ /AE + BF\ 

\Fj ~ ^CE + DF J 

provided that the /i columns of A and h columns of B are matched by l\ rows of E and I 2 
rows of F. The proof follows from writing out the appropriate sums. 


A.2 Properties of n x n matrices 


We now focus on ‘square’ n x n matrices. If A and B are n x n matrices, we can construct 
both AB and BA. In general, matrix multiplication is non-commutative, i.e. in general, 

AB ^ BA. 

The n x n identity matrix or unit matrix I is defined by fj = &ij, where Sij is the 
Kronecker 8: 


8 


U 


1 if i = j, 
0 if i ± j. 


From the rule for multiplication. 


IA = AI = A 


for any A. A is said to be diagonal if A , = 0 for i f j. 

Determinants', with a square matrix A we can associate the determinant of A, denoted 
by det A or | A , 1, and defined by 


det A — Sjj t A\jA 2 j . ■. A n j 
(remember the summation convention) where 




1 if i, j ,..., t is an even permutation of 1, 2. n, 

— 1 if i, j ,..., t is an odd permutation of 1, 2 
0 otherwise. 


An important result is 


Note also 


det(AB) = det A det B. 


detA T = detA, detl=l. 


If det A ^ 0 the matrix A is said to be non-singular, and det A f 0 is a necessary and 
sufficient condition for a unique inverse A 1 to exist, such that 

AA^ 1 = A~*A = I. 


Evidently, 


(AB) -1 = B'A 1 . 
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The trace of a matrix A, written TrA, is the sum of its diagonal elements: 

TrA = An. 

It follows from the definition that 

Tr(AB) = AijBji = B ji A u = Tr(BA), 

and hence 

Tr(ABC) = Tr(BCA) = Tr(CAB). 

A.3 Hermitian and unitary matrices 

Hermitian and unitary matrices are square matrices of particular importance in quantum 
mechanics. In a matrix formulation of quantum mechanics, dynamical observables are 
represented by Hermitian matrices, while the time development of a system is determined 
by a unitary matrix. 

A matrix H is Hermitian if it is equal to its Hermitian conjugate: 

II = H, or H ij = H* i . 

The diagonal elements of a Hermitian matrix are therefore real, and an n x n Hermitian 
matrix is specified by n + 2n(n — l)/2 = n 2 real numbers. 

A matrix U is unitary if 

IT 1 = U f , or UU f = U f U = I. 

The product of two unitary matrices is also unitary. 

A unitary transformation of a matrix A is a transformation of the form 

A -* A' = UALT 1 = UAU f , 

where U is a unitary matrix. The transformation preserves algebraic relationships: 

(AB)' = A'B', 

and Hermitian conjugation 

(A') t = UA t U t . 

Also 

TrA' = TrA. del A' = del A. 

An important theorem of matrix algebra is that, for each Hermitian matrix H, there 
exists a unitary matrix U such that 

H = UHL 1 = UHU’ = II,, 

is a real diagonal matrix. 

A necessary and sufficient condition that Hermitian matrices H, and Hi can be brought 
into the diagonal form by the same unitary transformation is 

H,H 2 - H 2 H! = 0. 

It follows from this (see Problem A.3) that a matrix M can be brought into diagonal form 
by a unitary transformation if and only if 

MM+ - M f M = 0. 

Note that unitary matrices satisfy this condition. 
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An arbitrary matrix M which does not satisfy this condition can be brought into real 
diagonal form by a generalised transformation involving two unitary matrices, Ui and U 2 
say, which may be chosen so that 

UiMUl = M 0 

is diagonal (see Problem A.4). 

If H is a Hermitian matrix, the matrix 

U = exp(iH) 

is unitary. The right-hand side of this equation is to be understood as defined by the series 
expansion 


U = I + (iH) + (iH) 2 /2! + --- 


Then 


U f = I + (—iH f ) + (—iH f ) 2 /2! + ■ ■ • 

= exp(—iH^) = exp(—iH) = U -1 

(the operation of Hermitian conjugation being carried out term by term). Conversely, any 
unitary matrix U can be expressed in this form. Since an n x n Hermitian matrix is 
specified by n 1 real numbers, it follows that a unitary matrix is specified by n 2 real 
numbers. 


A.4 A Fierz transformation 

It is easy to show that any 2x2 matrix M with complex elements may be expressed as a 
linear combination of the matrices a p . 


M = Z tl a'\ 

andZ^ = jTr(<j M M), since Tr(<T M (T v ) = 2<5 /iu . 

Consider the expression 

giiv{a*\v >l \b)(c*\o v \d), where |a), \b), |c), | d) are two-component spinor fields. Using 
the result above, we can replace the matrix |£}(c*| by 

\b)(c*\ = l -Tr(a x \b)(c*\)a l 

= - l -[c*\a k \b)d k . 

The last step is evident on putting in the spinors indices, and the minus sign arises from 
the interchange of anticommuting spinor fields. 

We now have 

g pv {a*\a p \b){c*\<7 v \d) = —~g lJLV {a*\a^G X a v \d >< c*\a x \b). 

Using the algebraic identity 

g flv a ,x a l a v = -2g pX a p , 

gives g llv {a*\a ll \b){c*\G v \d) = g p x{a*\a p \d){c*\a x \b). 

This is an example of a Fierz transformation. 
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Problems 

A.l Show that 

&ij...tA a j Apj • • • A V f — £ap...v det A. 

A.2 Show that if A, B are Hermitian, then i(AB — BA) is Hermitian. 

A.3 Show that an arbitrary square matrix M can be written in the form M = A + iB, 
where A and B are Hermitian matrices. Find A and B in terms of M and Mt. Hence 
show that M may be put into diagonal form by a unitary transformation if and only 
if MM T - M’M = 0. 

A.4 If M is an arbitrary square matrix, show that MM’ is Hermitian and hence can be 
diagonalised by a unitary matrix Uj, so that we can write 

Ui(MM t )Ui t = M fl 2 

where M 0 is diagonal with real diagonal elements > 0. Suppose none are zero. Define 
the Hermitian matrix H = Ui M D U i. Show that Y = H 1 M is unitary. Hence show 
that 

M = Ui^MbU 2 , 

where U 2 = Ui Y is a unitary matrix. 
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Appendix B 

The groups of the Standard Model 


The Standard Model is constructed by insisting that the equations of the model retain the 
same form after certain transformations. For instance, we require that the equations take 
the same form in every inertial frame of reference, so that they are covariant under a 
Lorentz transformation; this may be a rotation of axes or a boost, or a combination of 
rotation and boost. The Lagrangian density that describes the Standard Model takes the 
same form in the new coordinate system, and the Lorentz transformation is said to be a 
symmetry transformation. In the Standard Model, as well as symmetries under coordinate 
transformations, there are ‘internal’ symmetries of the particle fields. The corresponding 
symmetry transformations are conveniently represented by matrices. 

It is characteristic of symmetry transformations that they satisfy the mathematical 
axioms of a group, which we set out below. In this appendix we consider some properties 
of the groups that play a special role in the Standard Model. 


B.l Definition of a group 

A group G is a set of elements a, b, c, . . ., together with a rule that combines any two 
elements a,b of G to form an element ah, which also belongs to G, satisfying the 
following conditions. 

(i) The rule is associative: a(bc) = ( ab)c. 

(ii) G contains a unique identity element I such that, for every element a of G, 

al = la — a. 

(iii) For every element a of G there exists a unique inverse element a -1 such that 

aa~ l — a~ x a = I. 

If also ab = ba for all a, b the group is said to be commutative or Abelian. 

It is usually easy to determine whether or not a given set of elements and their 
combination law satisfy these axioms. For example, the set of all integers forms an 
Abelian group under addition, with 0 the identity element. The set of all non-singular 
n x n matrices in > 1) forms a non-Abelian group under matrix multiplication. The 
permutations of the numbers 1,2form a group which has n \ elements; this is an 
example of & finite group. The group of rotations of the coordinate axes is a 
three-parameter continuous group: an element is specified by three parameters that take on 
a continuous range of values. We shall be concerned principally with groups of this type. 
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B.2 Rotations of the coordinate axes, and the group SO(3) 

Consider a rotation of the coordinate axes about the origin. If the coordinates of a point P 
are (x 1 , x 2 , x 3 ) in a frame of reference K, and (x' 1 , x' 2 , x' 3 ) in a frame K', rotated relative 
to K, the x" are related to the x' by a real linear transformation of the form 

x n = R‘jX J . (B.l) 

R = (R'j) is the rotation matrix. For example, a rotation of the axes through an angle 6 
about the 03 axis in a right-handed sense is given by 

x' 1 = x 1 cost? + x 2 sin@, 
x' 2 = —x 1 sin 6 + x 2 cos 9. 
x' 3 = x 3 , 

and corresponds to the matrix 

/ cost? sinf? 0\ 

R 03 (<9) = ( -sin0 cost? 0], (B.2) 

V 0 0 1/ 

We may regard the x" and x‘ as 3 x 1 (column) matrices x' and x, and write the 
transformation (B. 1 ) as 

x' = Rx. 

The transpose x T of x is a 1 x 3 (row) matrix, and the scalar product of two vectors x and 
y is 

ll T T 

x y = x y = y x. 

In particular, the length OP is given by ,/(x T x). Since a rotation of axes preserves scalar 
products, 

x ,T y' = x T R T Ry = x T y. 

This holds for all pairs x, y. Hence 

R t R = I (B.3) 

where I is the identity matrix: hence the inverse of R is the transpose R T of R and R is 
said to be an orthogonal matrix. 

Since det R T det R = det(R T R) = det I = 1 and det R T = det R, (B.4) 
(det R) 2 =1, det R = ±1. 

Matrices corresponding to pure or ‘proper’ rotations have detR = +1. We can see this 
by noting that the identity rotation is a proper rotation, and det 1=1. Any proper rotation 
can be constructed as a sequence of infinitesimal rotations starting from I and hence by 
continuity also has determinant +1. 

The product of two orthogonal matrices is an orthogonal matrix, since 

(R!R 2 ) t = r 2 t r, t = Rr'Rr 1 = (R 1 R 2 )- 1 , 

and if det Rj = 1 and det R 2 = 1, 

det(R[R 2 ) = det Rj det R 2 = 1. 

Hence real orthogonal 3x3 matrices with det R = 1 form a group under matrix 
multiplication. This group is called the special orthogonal group and is denoted by SO(3). 

Orthogonal matrices with det R = — 1 also preserve scalar products. It is easy to see 
that inversion of the coordinate axes in the origin, x n = —x 1 , corresponds to an 
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orthogonal matrix with determinant — 1; a general ‘improper’ rotation corresponds to 
inversion in the origin together with a proper rotation. Improper rotation matrices do not 
form a group, since the product of two improper rotations is a proper rotation. 

A general proper rotation may be built up as a sequence of rotations about three 
different axes. For example, consider 

R(i/f, 6, 0) = R O 3"(0)Ro2'(0)Ro3(0)- (B.5) 

in an obvious notation. The direction of 03" is defined by 9 and 0, and then 0 defines the 
final orientation of 01"2" in the plane perpendicular to 03". Thus each element of SO( 3) is 
specified by just three parameters. (0, 9, 0 are known as the Euler angles.) 

We can also interpret the transformation (B.l) in an active sense. Consider a system 
described by a wave function ‘Fix) in the frame K. The system is described by 
3>'(x') = ^(R-V) in the frame IT. This is the passive interpretation. We might, 
alternatively, drop the primes on the coordinates and give this equation an active 
interpretation, supposing that the axes have been held fixed and the system given the 
inverse rotation R -1 . The wave function of the rotated system is <F'(x) = <1>(R [ x). 

B.3 The group SU(2) 

An n x n matrix U is unitary if UU = IP U = I. The product of two unitary matrices is 
unitary. Hence n x n unitary matrices form a group under matrix multiplication, denoted 
by U(n). 

Since 

det(UU f ) = detUdetU* = detU(det(U)* = detl = 1, 

we may write detU = e in “, where a is real. 

The special unitary group SU(2) is the group of all 2 x 2 unitary matrices with 
determinant equal to 1. These form a group, since if detUi = 1 and detU 2 = 1 then 
det(UiU 2 ) = detUi detlE = 1. SU(2) is a sub-group of U( 2). Every element of U{2) is 
the product of a phase factor e 1 ", which is an element of 1/(1), and an element of SU( 2). 

The group SU(2) is related in a remarkable way to the rotation group SO( 3) described 
in Section B.2. It is central to the electroweak sector of the Standard Model. 

Any element of 1/(2) can be put in the form 

U= exp(iH) 

where H is a Hermitian matrix (Appendix A). A general 2x2 Hermitian matrix may be 
taken as 

_ f a 0 + a 3 a 1 — ia 2 \ 

V^a 1 + ia 2 a 0 — a 3 J 

where the a ,l (/i = 0, 1, 2, 3) are four real parameters. This choice enables us to write 

H = a 0 I + Q! k cr k , (B.6) 

where the index k runs from 1 to 3, and 



The a k are the same as the Pauli spin matrices, and hence they satisfy 

(ct 1 ) 2 = (cr 2 ) 2 = (cr 3 ) 2 = I; a-V* + o k o> = 0 k\ 
[a 1 , a 2 ] = a'a 2 — a 2 a 1 = 2i<73, etc. 
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Since the unit matrix I commutes with all matrices, a general member of t/(2) can be 
written as 

U = exp i(a°I + a k o k ) = exp(ia°) exp(ia A cr A ). 

The phase factor exp(ia°) belongs to the group iJ( IJ. Hence elements of SU( 2) are of the 
form 

Uj = exp^c/er^). (B.8) 

An element may be specified by the three parameters a k ; the matrices o k are the 
corresponding generators of the group. Each has zero trace (see Problem B.l). 

The algebra of the o k matrices enables us to write these elements in closed form. Let us 
formally consider the a k to make up a vector a = aa, where a is the corresponding unit 
vector, and write the ‘scalar product’ a k o k as aa ■ cr. It is easy to see that 

(a ■ cr ) 2 = a 7 cr 7 a k o k = a'a J I = I, 

since cr 7 o k + a k <jl = 0 and (a 1 ) 2 = I, etc. Then the power series expansion of (B.8) 
gives 

(i°0 2 

U, s = I + ia(a ■ cr) H-—I H- 

= cos al + i sin a(a ■ cr). (B.9) 


To establish the connection between the groups SU( 2) and SO(3), we associate with 
each point x the Hermitian matrix 


X(x) = 


/ x 3 

y X 1 ix 2 



(B.10) 


This matrix has TrX = 0 and det X = — x k x k . 

Consider now an element U of SU(2) and the matrix 

X'^UXUt. (B.l 1) 


(We are now dropping the suffix sonU.) 

X' is also Hermitian, and TrX' = Tr(UXU f ) = Tr(UtUX) = TrX = 0. Hence X' is of 
the form 


X' = 


x' 1 — ix' 2 


—x 


where the x' k are related to the x k by a real linear transformation. 

Also det X' = det U det X det IP = detfUU j det X = det X, so that x' k x' k = x k x k . 
Since the length of x is preserved and the transformation may be continuously generated 
from the identity matrix (see Problem B.3), the transformation must correspond to a 
proper rotation of the coordinate axes and hence to a rotation matrix R(U). 

As an example, the SU(2) matrix 


U = exp[i(0/2)er 3 ] = cos(0/2)I + i sin(0/2)er 3 = 




(B.12) 


where we have used (B.9), corresponds to the rotation matrix RosCf) of equation (B.2). 
This may be verified by direct matrix multiplication. 

The matrices U and —U give the same transformation (B.l 1), and hence correspond to 
the same rotation matrix: to every element of SO( 3) there correspond two elements of 
SU(2), differing by a factor of — 1. In the example (B.12) above, rotations of 0 and 6 + 2jt 
about the 03 axis correspond to the same rotation matrix, but give matrices U and —U, 
respectively in SU( 2). 
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BA Group SL(2,C) and the proper Lorentz group 

B.4 The group SL(2,C) and the proper Lorentz group 

The set of all 2 x 2 matrices with complex elements and with determinant equal to 1 
evidently forms a group under matrix multiplication. This group is denoted by SL( 2,C). It 
is related to the group of proper Lorentz transformations in much the same way as the 
group SU( 2) is related to the group of proper rotations. 

We now associate with each point x = (x°, x) in space-time the general Hermitian 
matrix 

XM=( x ° + x \ -'l-" 2 ! (B.13) 

yx + IX x u — X J J 

which has 

det X = (x°) 2 -x k x k . 

Consider an element M of SL( 2,C) and the matrix X' given by 

MtX'M = XorX' = (M-'^XIVr 1 . (B.14) 


Then X' is also Hermitian and hence we can write 

x'° + x' 3 x' 1 — ix' 2 \ 

x' 1 + ix' 2 x'° - x' 3 ) ’ 

where the x' M are related to the x M (/r = 0, 1,2, 3) by a real linear transformation. Also 

det M f X'M = det M det X' det M = det X' = det X 



so that 

(x'°) 2 — x' k x' k — (x 0 ) 2 — x k x k . 


Hence the matrix M corresponds to a Lorentz transformation matrix L(M). The matrices 
L(M) form a group that includes the identity transformation L(I) = I, and hence by 
continuity correspond to proper Lorentz transformations. 

A general proper Lorentz transformation between frames K and K' is specified by six 
parameters: three parameters to give the velocity v of K! relative to K and three parameters 
to give the orientation of K! relative to K. A general 2x2 complex matrix is defined by 
eight real parameters. The condition detM = 1 reduces this number to six. Hence a matrix 
M can be found corresponding to every proper Lorentz transformation. The matrices M 
and —M give the same transformation (B.14): two elements of SL( 2,C) correspond to each 
element of the proper Lorentz group. 

The matrix 


P = exp[(0/2)cr 3 ] = cosh(0/2)I+ sinh(0/2)<r 3 = 




(B.15) 


corresponds to the Lorentz boost (2.3) of Chapter 2, as may be verified by direct matrix 
multiplication. 

More generally, a Lorentz boost from a frame A' to a frame K' moving with velocity 
v = tanh 0 in the direction of the unit vector v is given by 

P = exp[(0/2)v-er] = cosh(0/2)I+ sinh(0/2)v-er 
where a = (o-^er^a 3 ). 

Note that, since the matrices a k are Hermitian, so also is any matrix P corresponding to 
a Lorentz boost. 
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B.5 Transformations of the Pauli matrices 


In discussing Lorentz transformations, it is convenient to write I = o° and introduce the 
notation 


_/i , 0 1 2 _3\ ~fi / n 1 2 3\ 

<y — {a , <7 , a , o ), a — (<r , —o , —er , —er ). 

Then from (B.13) 

X(x) = x°cr 0 + x k o k = x^a^, X'(x') —x'^o 11 . 

The relation 

M+X'M = X 


fB. 16) 


gives 

x'Mtcr^M —x v d v — L ,l v cr v x' 

(see Problem 2.2). Since the xf are arbitrary, we can deduce 

—L^ v d v . (B .17) 


Also (Problem B.6) 

L^y = ^Tr(o' y M t (T M M). 

Similarly, by considering the matrix 

Xi(x) = X°CT° — x k o k = x v o v , 

which also has det Xi = (x 0 ) 2 — x k x k , we can show that there exists a matrix N belonging 
to SL( 2,C) such that 

N f cr"N = (B.18) 


The matrices M and N are evidently related. The reader may verify directly that when 
M = P, where P is given by (B.15) and corresponds to a Lorentz boost, we can take 
N = P~ 1 , and this will be true for a Lorentz boost in any direction. For a pure rotation of 
axes, we take M = N = U, where U is a unitary matrix. A general M can be constructed 
as a product of a rotation followed by a boost: M = PU. The corresponding N is given by 
N = P-‘U. 

Now U satisfies UU = I, and we noted that P is Hermitian, P = P . Hence 

NM f = (p-'UXUtp) = I, (B. 19) 

so that N is the inverse of ML 

The results (B.17) and (B.18), together with (B.19), are useful in constructing Lorentz 
scalars, vectors and higher order tensors. 


B.6 Spinors 


We define a left-handed spinor 



as a complex two-component entity that transforms under a Lorentz transformation with 
matrix L(M) by the rule 

1' = Ml (B.20) 

i.e. l' a = M a blb, where a and b take on the values 1, 2. 
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We similarly define a right-handed spinor 



as a two-component entity that transforms by 

r' = Nr. 


(B.21) 


Electrons, and all other fermions in the Standard Model, are described by spinor fields. 
The nomenclature of ‘left-handed’ and ‘right-handed’ is elucidated in Section 6.3. 

Spinors have the remarkable property that they can be combined in pairs to make 
Lorentz scalars, Lorentz four-vectors and higher order Lorentz tensors. For example, 

I i =1* a r„ is a (complex) Lorentz scalar, since 

l'V = (MI) f Nr = l+M+Nr = l f r, (B.22) 

where we have used (B. 19). 

The quantities 

lf<jl = l^(cr 0 , —cr 1 , — <7 2 , — cr 3 )l, 
r'Trr = r^fcr 0 , a 1 cr 2 ,cr 3 )r, 

transform like (real) contravariant four-vectors, since 

l'VT = l^d^Ml =L'\(l t or'T), (B.23) 


using (B.17), and 


r'VV = r^N^Nr = L'\,(rVr), 


(B.24) 


using (B.18). 


B.7 The group SU(3) 

The special unitary group SU( 3) is the group of all 3 x 3 unitary matrices with 
determinant equal to 1. Our discussion will parallel our discussion of the group SU( 2) in 
Section B.3. An element of SU(3) can be expressed as 

U = exp(iH) 


where H is a 3 x 3 Hermitian matrix. A general 3x3 Hermitian matrix is specified by 
3 2 = 9 real parameters (Appendix A). The condition det U = 1, or equivalently TrH = 0 
(Problem B.l), reduces this number to 8. In place of the u k matrices used in Section B.3, 
we have the eight traceless Hermitian matrices introduced by Gell-Mann: 


° 

*1=1 

Vo 

H? 

/° 

*7= 0 

Vo 


1 0\ /0 -i 0\ (\ 0 0\ 

0 0 , X 2 =[i 0 0 A.3 = ( 0 -1 0 ) , 

00 / Vo 0 0 / Vo 0 0 / 

0 1\ /0 0 -i\ /0 0 0\ 

0 0), k 5 =|0 0 0), X 6 = I 0 0 1), 

00/ \i 0 0/ \0 1 0/ 

0 0\ /1 0 0\ 

0 -i , k 8 = (1/V3) pi 0 . 
i 0 / VO 0 - 2 ) 


(B.25) 
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A general traceless Hermitian matrix is of the form 

H = aqki + 012^2 + • • • + £*8^8 

( a 3 + a 8 /\/3 a\ — ioi2 a 4 — id's 

«i + ia2 -o'3 + a 8 /V3 o!6 - i«7 

a\ + id's a b + io!7 -2a 8 /V3 

The matrices X a satisfy the commutation relations 


(B.26) 


8 



(B.27) 


where the f a b c are the structure constants (cf. equations (B.7)). The are odd in the 
interchange of any pair of indices, and the non-vanishing f a b c are given by the 
permutations of f m = 1, /i 47 = / 24 6 = fist = /345 = /516 = /637 = 1/2, /45s = 
/ 678 = V3/2. 

The matrices also have the property 


Tr(X a X b ) = 2 8 ab , 

where 8 ab is the Kronecker 8. 

These results may be verified by direct calculation. 


(B.28) 


Problems 


B.l Show that if U = exp(iH) and Tr H = 0, then det U =1. (Make H diagonal with a 
unitary transformation. U is then also diagonal.) 

B.2 Verify that the SU(2) matrices exp[i(0/2)cr 1 ] and exp [i(0/2)a 2 ] correspond to rota¬ 
tions Rqi(9) and Rq2(9), respectively. 

B.3 Show that the SU( 2) matrix corresponding to the rotation R(i/f, 6, <p) (equation (B.5)) 
is 

/ e 1 ^ 2 cos(0/2)e^ 2 e^ /2 sin(6>/2)e" i « > / 2 \ 

\ — e _i ^/ 2 sin(0/2)e'^ /2 e"^ 2 cos(0/2)e-^ /2 / ' 

B.4 Show that ll<7 / V v r transforms as a tensor and t(o^o v +d v o ,1 )r — 

B.5 Show that the rotation matrix /?' of equation (B.l) is related to the SU(2) matrix U 
of (B.l 1) by 



B.6 Show from (B. 17) that 


L M 
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C.l The simple harmonic oscillator 

The reader may well have met annihilation and creation operators in treating the quantum 
mechanics of the simple harmonic oscillator. In this context, an operator a and its 
Hermitian conjugate a ' are constructed. These satisfy the commutation relations 

[a, = aa t — a^a = 1 (C.l) 

and also of course 

[a, a] = 0, [A o'] = 0. 

The operator N = a a is Hermitian. We denote by | n) the normalised eigenstate of N 
with eigenvalue n. Since n = {n\a'a\n) is the modulus squared of the state a\n), n is real 
and > 0, and equal to 0 only if a\n) = 0. 

It follows from the commutation relations that the lowest eigenstate of n is n = 0, 
corresponding to the ground state |0). This is because 

Na\n) = a'aa\n) = ( aa t — \)a\n) — (n — l)a\n). 

Thus a\n) is, apart from normalisation, an eigenstate of N with eigenvalue (n — 1), unless 
a\n) = 0. Similarly ci\n — 1) is an eigenstate of N with eigenvalue (n — 2), and so on. The 
process must terminate at the eigenstate |0) with eigenvalue 0, and a \ 0) =0, since 
otherwise we would be able to violate the condition n > 0. 

Similarly a^\n) is, apart from normalisation, an eigenstate of N with eigenvalue (n+ 1). 

Thus the eigenvalues of the number operator N are the integers 0, 1, 2, 3 . . . 

Since (n\a'a\n) — n, we have 

a\n) = n l/2 \n - 1>. (C.2) 

Also, (n\aa'\n) — (n\a'a + 1| n) = n + 1 , so that 

fl 1>> = („ + l) 1 / 2 |n+ 1). (C.3) 

We call a an annihilation operator and a' a creation operator. 

Written in terms of a and a', the simple harmonic oscillator Hamiltonian becomes 

H = hco = ha>, (C.4) 

where w is the frequency of the corresponding classical oscillator (Problem C.l). The term 
\h(» is the zero-point energy. Since in field theory only energy differences are of physical 

235 

~PuH.c. 7fcaf4cMo£tca/ ~Ph.yS.LC-A. 


236 


Appendix C: Annihilation and creation operators 


significance, it is usually convenient to redefine H, dropping the zero-point energy and 
taking H = a'aha. We may then reinterpret the state | n) as a state in which there are n 
identical ‘particles’ each of energy Tia>, associated with the oscillator, and say that a and a' 
annihilate and create particles. 

In the Heisenberg representation (Section 8.2), 

a(t) = e iH 'ae~ iHl = e' Na “ = sT wt a. (C.5) 

This may be seen by considering the effect of a(t) acting on a state | n) 9 and noting that, 
since 

e ±Wft » = e ±n<Bt |«), 

the two expressions for a(t) give the same result. Similarly, 

a f (t) = e ia>, a f . (C.6) 

C.2 An assembly of bosons 

A similar operator formalism may be developed for assemblies of identical particles. We 
set out first the formalism when the particles are bosons. 

Let w, (£) be a complete set of single particle states, where £ stands for the space and 
spin coordinate of a particle. We define annihilation and creation operators a, and a] for 
each state, satisfying the commutation relations 

[a,-, Uj ] = 8jj, [a,-, aj] = 0, [«,', aj ' ] = 0. (C.7) 

Any state of the system can be constructed by operating on the vacuum state |0), in 
which there are no particles present, and a, |0) = 0 for all i. For example, a three-particle 
state having two particles in the state u i and one particle in the state ui is given (apart 
from normalisation) by a\a\a\ |0). Evidently such a state is symmetric in the interchange 
of any two particles since the creation operators all commute, and the particles will obey 
Bose-Einstein statistics. 

It follows from the commutation relations that the number operator N, — a] gives the 
number of particles in the state . In the case of non-interacting bosons, the «,■(£) can be 
taken as the single particle energy eigenstates and the Hamiltonian operator is then 

H 0 = a l a i £ i = HiSi, (C.7) 

i i 

where the e, are the single particle energy levels. 

In the Heisenberg representation and with the free particle Hamiltonian // 0 , the time 
dependence of the annihilation and creation operators is like that of simple harmonic 
oscillator operators, and follows by a similar argument: 

fl,(f) = e _1£if fl/, a[{t) = e ie,f a|. (C.8) 

C.3 An assembly of fermions 

In the case of an assembly of identical fermions, we define annihilation and creation 
operators b ,■ and for each single particle state n, (f), which are anticommuting: 

{bi,bjU = bibji + b/b, = S ijt {bi,bj} = 0, {*,-+. 6/} = 0. (C.9) 




Problems 
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In particular, 

{bif = 0, (fc/) 2 = 0. (C.10) 

Thus two fermions cannot be annihilated from the same state, or created in the same state, 
in accord with the Pauli principle. 

The number operator N, — h; h, satisfies 

N 2 = b t %b t % = biH 1 - bi%) bt = b^bi = Ni , 
or 

N t (Ni - 1) = 0, 

so that the eigenvalues of N, are 0 and 1. This, again, is in accord with the Pauli principle. 
A many-particle fermion state can be constructed by operating on the vacuum state |0> 
with creation operators. For example b\'bpbs'\Q) is a state with a fermion in each of the 
states U\, u 2 , us- Such a state is antisymmetric under particle exchange, and the particles 
obey Fermi-Dirac statistics. 

In the case of an assembly of non-interacting fermions, the Hamiltonian operator is 

= (C.ll) 

i 

and in the Heisenberg representation 

*/(*) = e-^'ii, bi\t) = j Sit bl. (C.12) 

Problems 

C.l With rescaling of coordinates, 

P = p/(mhco) 1 ^ 2 , X = x(jnco/tl )^ 2 , 

the simple harmonic oscillator Hamiltonian 

PI = (p 1 /2m) + (murx 2 /2) 

becomes 

H = {hco/2)(P 2 + X 2 ), 

and 

[X, P] = i 

Show that if a = ( l/V2)(X + i P), a) = (1/V / 2)(X — i P), then 

, , 1 

[a,fl l ] = l and H = (a'a +-)bco. 

C.l Show that the normalised ground state wave function of the simple harmonic oscillator 
is ( mco/rcfi )^ 4 exp(—moL>x 2 /2fi). 

C.3 Using the commutation relations for fermions show that the state b, |0) is an eigenstate 
of Nj = b, bj with eigenvalue 1. 

C.4 Show that the matrices 

b =(o o) and bt = (l o) 

satisfy the commutation relations for fermion annihilation and creation operators. 


Piihc. 7/fafAow.afteiiZ P/ty.Atc.A 


Appendix D 

The parton model 


D.l Elastic electron scattering from nucleons 

In the 1950s, experiments on elastic scattering of electrons from nucleon targets at rest in 
the laboratory revealed the electric charge distribution in protons and neutrons, clearly 
establishing the size of the nucleons. 

The differential cross-section for the elastic scattering of electrons at high energies 
from a Dirac particle of mass M and charge e may be calculated in QED. To leading order 
in the fine-structure constant a — e 2 /Att, and neglecting the electron’s mass compared 
with its energy, the differential cross-section for scattering from an unpolarised Dirac 
particle, initially at rest in the laboratory frame, in which the scattered electron emerges at 
an angle 0 with respect to its incident direction, is 


where 


da 


4 E 2 sin 4 (0/2) 



cos 2 (0/2) 


Q 2 

2 M 2 


sin 2 (0/2) 


(D.l) 


(E , p) = initial electron energy-momentum four-vector, 

( E', p') = final electron energy-momentum four-vector, 

= (E — E', p — p') = energy-momentum transfer, 

Q 2 = -0M0" = (P - p') 2 -(E- E') 2 . 

(See, for example, Gross, 1993, p. 294.) 

Note that Q 2 is Lorentz invariant. For elastic scattering at a given energy, the angle 0 
determines, through energy and momentum conservation, all other quantities in the 
expression. For example, 

Q 2 = AEE' sin 2 (0/2), (D.2) 

where the energy E is given by 

M(E - E') - 2EE' sin 2 (0/2) = 0 (D.3) 


(Problem D.l). 

Taking M to be the proton mass, the formula (D. 1) does not fit the experimental data 
and, indeed, since the proton has an anomalous magnetic moment sa 1 ,19(eh/2M), we 
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would not expect a fit. More generally, the elastic scattering from an unpolarised 
‘extended’ proton is of the form 


d a 
d£> 


r? 


/f(C 2 )+ 8 


4 E- sin 4 (0/2) V E 
+ ^iMQ 2 ) + HQ 2 )} 2 sin 2 (0/2) 


4M 2 ' 


fl(Q 2 ) cos 2 (0/2) 


(D.4) 


The form of this expression is essentially determined given the proton has spin 1 /2 and 
no electric dipole moment. fi(Q 2 ) is called the Dirac form factor of the proton, and 
fi{Q 2 ) is the form factor associated with the anomalous magnetic moment. At 
Q = 0, /i(0) = 1 and //(O) ~ 1.79 (corresponding to the anomalous moment). The 
electric and magnetic form factors 

O 2 

Ge(Q 2 ) = MQ 2 ) - (G 2 ), (D.5) 


Gm(G 2 ) = /i(G 2 )+/ 2 (G 2 X (D.6) 

can be interpreted in the non-relativistic limit as Fourier transforms of the electric charge 
and magnetic moment distributions in the proton (Problem D.2). It is from their 
experimental determination that the size of the proton is inferred. Both fi(Q 2 ) and //(G 2 ) 
fall off rapidly as Q 2 increases (Fig. D.l). Similar form factors can be defined, and 
determined experimentally, for the neutron (using scattering data from deuterium targets). 
The analysis is consistent with the quark model. Since the electric charge is earned by the 
quarks, the charge and magnetic moment distribution should trace the distributions of 
quark charge and quark magnetic moment. 


D.2 Inelastic electron scattering from nucleons: the parton model 

The early elastic scattering experiments were performed at electron energies < 500 MeV. 
Scattering at higher energies has thrown more light on the behaviour of quarks in 
nucleons, and revealed properties that will continue to be crucial for pursuing particle 
physics at the even higher energies of the future. Except where Q 2 is small, inelastic 
scattering, which involves hadron production, becomes the dominant mode at higher 
energies. In the case of inelastic scattering, 0 and E' are independent variables. In general, 
there are many other independent variables that describe the final hadronic system, but the 
very important differential cross-section d : a/d/-’'dQ, called the inclusive cross-section, 
includes all the possible final hadronic states. 

At the electron-proton collider HERA at Hamburg a beam of 30 GeV electrons meets a 
beam of 820 GeV protons head on. Many features of the ensuing electron-proton 
collisions are described by the parton model, which was introduced by Feynman in 1969. 

In the parton model each proton in the beam is regarded as a system of sub-particles, 
called partons. These are quarks, antiquarks and gluons. Quarks and antiquarks are the 
partons that carry electric charge. The proton’s energy and momentum P ,L is envisaged as 
being distributed over the different parton types i with certain probability distributions. 
The mean number of partons of type i in the proton carrying energy and momentum in the 
range xP^, ( x + dx)P l ‘, 0 < x < 1, is written p,(x) d.r. Here the label i covers all types 
of quarks, antiquarks and gluons (u, u, d, d, s, s, etc.). Scaling both energy and momentum 
by the same factor ensures that all the partons have the velocity of the proton. Any 
transverse momentum a parton may have is neglected. Thus, in the model, each proton in 
the HERA beam is regarded as a sub-beam of partons. The consequences of the model for 
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Figure D.l This figure shows the measured magnetic dipole form factor of the 
proton. The data are quite well represented by the simple expression 

1 2 


Gm(Q ) = dp 


1 


Li + Q 2 /P 2 


with fx p — 2.79, P — 0.84 GeV. This curve is shown. 

Forg 2 < 3 GeV 2 , G E = {Q 2 )G M (Q 2 )/butfor Q 2 > 5GeV 2 only G M (Q 2 ) 
can be measured with accuracy (see Coward et al., 1968). 


the inclusive cross-section can be most easily demonstrated in the rest frame of the proton. 
In this frame, a parton with energy-momentum fraction x will behave like a particle of 
mass xM at rest. For Q 2 < M 2 the dominant scattering will be electromagnetic scattering 
from the charged partons: the spin 1/2 quarks and antiquarks. For the elastic scattering 
from a parton of type i with effective mass xM we have 


dV 

d£'d£2 


(xM)E . , 7 ( der' \ 

—J~S{(E' - E)(xM) + 2EE' sin 2 (0/2)} I — I 

H. \ / elastic 


(D.7) 
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where (da'/df2) e i ast i C is of the form given by (D.4), but with M replaced by (xM), and a 2 
by qfa 2 where qf = (1 /3) 2 or (2/3) 2 depending on the type of parton. On integrating over 
E ', the 5-function in (D.7) picks out the energy for elastic scattering through an angle 0, as 
required by the condition (D.3) with (xM) in place of M. 

(Note that 8{aE' — b) = (E'/b)8(E' — b/a ), a > 0)). If we define 

v = E - E' 


then 


dV 

d E’ d£2 


(xM)E 
E’ 


8{(xM)v 


Q 2 / 2} 


dcr'\ 

d£2 ) 


elastic 


(D.8) 


Averaging over a large number of collisions, and assuming that the partons scatter 
incoherently, the inclusive cross-section in the parton model is 


d 2 er 
dE' d£2 


/ 


^}E SiUM)v - Q q 2] !><.>(£) 


\dx 


x E ( / dcr' \ 

~v~E’ (d£2 ) 


(D.9) 


elastic 


where 

x = Q 2 /2Mv, (D.10) 

and the sum is over all types of charged partons. Finally, inserting explicitly the general 
elastic scattering formula (D.4) 


d 2 cr 

d£'d£2 

where 


2 ME 2 sin 4 (0/2) 


^F 2 (x, G 2 )cos 2 (0/2) + Fdx, Q 2 ) sin 2 (0/2) 
2v 


f 2 (x, q 2 ) = xJ2 { {fi) 2 + Ym 7 x (^) 1 I > 

i 

Fi(x,Q 2 )= l -J2PiMqf{(fi) + (A)} 2 


(D.l 1) 

(D.12) 

(D.13) 


(using (D.10), Q 2 /4x 2 M 2 = v/2Mx). 

In fact the form (D.l 1) for the inclusive cross-section, in terms of two structure 
functions f) (x, Q 2 ) and F 2 (x, Q 2 ), is quite general, and does not depend on the model we 
have introduced. 

The wavelength ft/Q is a measure of the scale on which the structure of the proton is 
explored in an electron scattering experiment. For low Q, such that fi/Q is large compared 
with the size of the proton, we can anticipate that the electron is scattered coherently from 
the proton as a whole. It is at high Q that the parton model becomes interesting. For Q 1 > a 
few GeV 2 , incoherent parton scattering seems to dominate, and the quarks and antiquarks 
in the proton apparently behave almost like free elementary particles: their anomalous 
moments can be neglected and we can set = 0. Then from (D.12) and (D.13) 

F 2 (x, Q 2 ) = 2xF\(x, Q 2 ). (D.14) 

This, the Callen-Gross relation , is well satisfied experimentally. 

If the charged partons are structureless Dirac particles, f[ = 1 for all Q 2 , so that 

Fi(x, Q 2 ) = xJ 2 Pi(x)qf = F 2 (x), (D.l5) 

i 

Fi(x , Q 2 ) = \Y, P> (x rf = F ' (°-16) 
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d 


v 



LI 


Figure D.2 An illustration of a muon neutrino converting to a muon on scattering 
from a d quark in a nuclean. The illustration indicates three ‘valence quarks’. In 
fact there is additional scattering from quark-antiquark pairs that are generated by 
the gluon field. 


and both F 2 and F\ depend only on the dimensionless parameter x = Q 2 /2Mv. This is 
Bjorken scaling. 

F 2 (x, Q 2 ) is illustrated in Fig. 17.3 over a wide range of values of Q 2 and x. It can be 
seen that the naive parton model is not strictly correct, but that the Q 2 dependence is weak 
compared with that of the elastic form factor of the proton (Fig. D. 1). It is usual to rewrite 


(D.12) as 



(D.17) 


associating the Q 1 dependence with the parton distribution itself rather than with the 
parton form factor. (See the discussion of the Altarelli-Parisi equations of QCD in Section 
17.3.) 

To determine the individual parton distributions p, (x , Q 2 ) introduced in equation 
(D.17) requires more information than is contained in the proton structure functions alone. 
The neutron has been investigated using deuteron targets, and, using the isospin symmetry 
between the neutron and proton (u -o- d, u d), the neutron data give further 
independent information. The weak interaction between quarks and leptons is described in 
Chapter 14. Neutrino and antineutrino inclusive cross-sections on proton and deuteron 
targets (Fig. D.2) give a further four independent relationships, so that, neglecting the 
contributions of heavier quarks, the individual u, d, s, u, d, s parton distributions can be 
estimated. In this approximation, (D.17) becomes 


4 

F 2 (x) « - 

where u(x) = p u (x), etc. 



(D.18) 
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Figure D.3 Curve 1 is of x(w(x) — u(x)) (see equation (D.18)). u(x) — u(x) is 
called the valence u quark distribution function. Curve 2 is. \{d{x) — d(x)),(d(x) — 
d(x)), the valence d quark distribution function. 

Curve 3 illustrates the sea quark distribution. Neglecting the generation of cc, 
bb and tt pairs, curve 3 is of x(w(x) + d(x) + 5(x)). 


Figure D.3 shows acceptable sets of parton distributions for the proton at Q 2 — 5 GeV 2 
and at Q 2 = 10 4 GeV 2 . With the present precision of the data these curves can be taken 
only as a fair indication of their forms. They have been constructed to satisfy the condition 
that the total parton charge is equal to e: 

qi pi(x ) dx = 1, 

but it is important to note that the charged partons carry only about one half of the total 
proton momentum: 

xpi(x) dx & 1/2. 

The remainder is presumably carried by the electrically neutral gluons. 
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D.3 Hadronic states 


The basic idea of the naive parton model is that at high Q 2 an electron scatters from a free 
elementary quark or antiquark, and the scattering process is completed before the recoiling 
quark has time to interact with its environment of quarks, antiquarks and gluons. Thus in 
the calculation of the inclusive cross-section the final hadronic states do not appear. 

In the model, at large Q 2 both the electron and the struck quark are deflected through 
large angles. Figure 1.10 shows an example of an event from the ZEUS detector at FLERA. 
The transverse momentum of the scattered electron is balanced by a jet of hadrons that can 
be associated with the recoiling quark. Another jet, the ‘proton remnant’ jet is confined to 
small angles with respect to the proton beam. Events like these give further strong support 
to the parton model. 

The ‘deep inelastic’ scattering data, when interpreted within the parton model, require 
the nucleon to have some u and d content, and also to contain ss quark-antiquark pairs 
(Fig. D.3). How is this to be reconciled with the simple quark model of nucleons at rest 
that we used in Chapter 1? A quark of the ‘three quark’ model of a nucleon, often called a 
constituent quark , is to be regarded as an elementary quark dressed with the strong 
interaction field, which will itself induce fluctuating quark-antiquark pairs. The quarks in 
the parton model are to be regarded as more like elementary quarks. 

In quantum field theory, it is a non-trivial matter to make a Lorentz transformation on 
the internal wave function of a complex interacting system like a nucleon. The quark and 
gluon content of a proton are frame dependent. Because of time dilation, the time scale of 
the internal dynamics of the nucleon becomes long in a frame in which its momentum is 
large, and in this frame the parton distribution will be fixed over the time of interaction 
with an electron in a deep inelastic scattering experiment. The parton distributions in the 
model are taken to represent the distributions in this ‘infinite momentum’ frame. 

Problems 


D.l 

D.2 


D.3 


Verify equations (D.2) and (D.3). 


In quantum mechanics, the differential cross-section for the elastic scattering of an 
electron with energy E m e from a fixed electrostatic potential </; (r) is given in 
Born approximation, and neglecting the effects of electron spin, by 


da 

d£!~ 


/ E 


= 


\2n 


(«/ 


0 ( r) e iqr d 3 x 


where q is the difference between the initial and final wave vectors of the electron. 

a. Show that q = |q| = 2E sin (0/2), where 6 is the scattering angle. 

b. Poisson’s equation relates the potential (f> (r ) to the charge density p(r) by V 2 </> = 
—p. Noting that V 2 e iqr = —^ 2 e iq r , and integrating by parts, show that 




Thus a measured cross-section can be used to infer the Fourier transform of the charge 
distribution, as this simple example illustrates. 


Taking Q 2 and v as independent variables instead of E' and 0, show that 
d 2 a 1 d 2 a EE' d 2 a 

d£'df2 2tc dE' d(cos0) n dQ 2 dv 
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Appendix E 

Mass matrices and mixing 


E.l K° and K° 


A phenomenological description of the time development of an electrically charged 
meson | P) at rest is given by the equation 

i^\P)=[m-(i/2)r]\P) (E.l) 

dr 

with its solution 


|P(f)> = |P(0)}e- iffl? - (1/2)r ' 


Here, m is the meson mass, T is the decay rate and 1/ Y is the mean life of the meson. 

Electrically neutral mesons, for example K°(ds) and B°(db), which have a distinct 
antimeson, in this example K°(sd) and B°(bd), can mix so that (E.l) becomes two coupled 
equations. For K° and K° these are 


.d / |KO> W, w - (l /2)r -p 2 \/|K°>\ 

‘dr \\K°)) V -q 2 m-(i/2)r) V|K°>; 


(E.2) 


p 2 and q : are two complex numbers. We can regard the 2x2 mass matrix as an 
‘effective’ Hamiltonian // wea k. The equality of the diagonal elements of // wea k is 
guaranteed by CPT invariance. The weak interaction generates the off-diagonal elements 

(K°|H w eak|K°> = -p 2 , (K°|//weak|K°) = -q 2 . 

Contributions to p 2 and q 1 are illustrated in Fig. E.l. 

By substitution into (E.2) it can be seen that the eigenstates of // wea k are 



|K S > = A[/7|K°)+g|K°>] 

(E.3) 

and 




|X L ) = A[p|K°) — <?|K°}] 

(E.4) 


with eigenvalues m — iT/2 — pq and m — iT/2 + pq respectively. 

N = (|p| 2 + |<7| 2 ) -1 / 2 is a normalising factor. We choose the sign of the square root, 
pq = 'J~p 2 q 2 , so that Imipq) is positive; then Kl has a longer mean life than Ks. 

The mass difference Am = 2Real( pq) (from experiment Am ~ 3 x 10 -12 MeV). We 
shall identify m with the mean mass of Ks and Kl. The mean lives are 
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d 


>- 


OQ 


>■ 


d 



V/S Vjd 



<K°l^weaklK°) 


<K°|tf we aklK°> 


Figure E. 1 Quark diagrams illustrating how the weak interaction with W bosons 
generates mixing, q, , and qj are any of the (2/3)e charged quarks u, c or t. The 
mixing matrix elements are proportional to the products of the four KM factors in 
the diagrams. 

1 1 

ti *=» -and r« = -(from experiment 

r - 2 Im (pq) F+ 2 Im (pq) F 

r L « 5 x 10~ 8 s, ts ~ 10 10 5.) The subscripts L and S refer to the long and short lives. 

From lattice estimations of the bound state wave functions and other QCD 
modifications, p 2 and q 2 can be calculated by perturbation theory in the weak interaction. 
Fig.E.l illustrates the fact that because some of the KM factors V, s , etc. are complex 
numbers, p and q are not equal. As a consequence neither |Kl) nor |Ks> is an eigenstate of 
CP. See Section (18.4). 

E.2 B° and B° 

The neutral B meson pair B° and B° mix by the same mechanism as the neutral K mesons. 
The parameters m, 1’, p 2 and q 2 take, of course, different values. 

For the B pair Im (pq) is much smaller than F so that the two mean lives are almost 
equal. There are two particles of different mass: 


|B l > = iV[p|B 0 > + </|B 0 )], 
|Bh> = iVTplB 0 ) — g|B°>]. 


The subscripts L and H refer to their masses: light and heavy. 

For B°B° mixing it is a fortunate circumstance that the top quark q, = t, q , = t gives 
the dominant contribution to p 2 and q 2 , p 2 is proportional to (VtbVjJ) 2 and q 2 is 
proportional to (V^Vtd) 2 (see Fig. E. 1) Calculations result in the expressions 



(E.5) 


(Donoghue et al., 1992, p. 395.) 

All other contributions are smaller by factors of ( m c /m t ) 2 , /« B is the B meson mass, 

/b ~ 0.3 GeV is its Teptonic decay constant’ and F tt is a dimensionless number, real to a 
very good approximation. 

With real, lm(pq) = 0, and Bl and Bh have the same mean life. Within 
experimental error this is seen to be so. Also \p\ = \q \ and p = |/?|e l13 q = Iple -1 * 3 . 
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(See the unitarity triangle, Fig. 18.2). Hence 


lw = 7i [ ' 


;if5 | B 0 


—iP |r°' 


B°)] 


|B h > = -= [e ip |B°) - e“ if5 |B°)] . 

A Bl meson or a Br meson, at rest, develop independently with time 

|B l (0> = |Bl (o)) e _i(m_Am / 2),_f / 2r , 
|B h (0) = |B h (o)> Q-m+^ n /2),-,/2 r_ 


(E.6) 


After some algebra it then follows that an initial B° or B° develops in time into a mixture 
denoted by 


|B° hy (0> 

|Bp hy (0) 


/ A mt\ n a (Amt\ - 

>s f — ) |B > + ie _2l ^sin l — J |B°> 

,■« / A mt\ (Amt\ - 

: 2l ^sin ( —— j |B°) + cos ( —— ) |B ) 


\ 2 


Q—imt—t/lz 

,—unt—t/2z. 


(E.7) 


If the meson decays at time t , to a final state |f) the decay amplitude for an initial B° will 
be 


(f|Bp hy (?)> 

and an initial B° 

(f|B° hy (0) 


/ Amt\ l; o / A mt\ - 

cos ( —— ) Af + ie 2 ^sin [ ^ J Af 


-\rr 


3 — imt—t/2z 


to / A mt\ (Amt \ - 

ie 2 /3 sin ( —— j A f + cos ( —— j A f 


-\mt—t/2z 


(E.8) 


Af = (f|B° hy ) and Af = (f|B° hy ) are the amplitudes for the decays B° —>■ f and B° 
the charge parity (CP) of f is +1 then it does not couple to the CP = — 1 state (B° 
hence Af = Af. The decay rates are then 

Rate(B° hy (f) f) = |A f | 2 e - f / T [l + sin(2£)sin (mt)] 
Rate(B° hy (f) -► f) = ]A f ] 2 e -^[l - sin(2/J)sin(/«f)]. 


-> f. If 
— B°); 

(E.9) 


If f has CP — — 1 the same expression results but with the + and — signs interchanged. 

At Cleo, Babar and Belle, B° and B° mesons are produced in pairs. If one undergoes a 
leptonic decay with a negative charge lepton it must have been a B°, its partner, at that 
instant is a B° and it is the time dependence of this second decay that is measured. 

Similarly a positive charge lepton identifies a B° decay that leaves its partner an initial 
B°. This procedure is called tagging. The mass difference Am and sin 2/1 are measured by 
tracking the time dependence of tagged mesons. 

The formulae for p 2 and q 1 for K°, K° follow the same pattern as for B decays but the 
top quark contributions are highly suppressed by very small KM factors, c and u quarks 
contribute significantly and the simplicity for B mesons is lost. 
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Hints to selected problems 


Chapter 2 

2.1 a!= gn P a' p = g w L p ),a x = g w L p x g Xv a v . Hence a'= L p a v where L/ = 
g pp L p x g Xv . In particular, Lq 1 = gooT 0 ^ 11 = ~L° i. 

2.2 a! p = L p v a v . Multiply on the left by L p p ■ L p p a' p = L p p L p v a v = a p , or a p = 
a V L V P . Similarly, a p — a! V L V IL . 

2.3 d (h = — —dx p = — —dx' v — —— L v u dx p . Since the dx p are arbitrary, 

dx p dx ,v dx' v 1 

d4> _ d<P LV 

dx p dx n M 

This is a covariant vector field transformation (Problem 2.2). 

det ( L /) = det(g w ) det (ZA) det(g Al; ) 

= (-l) 2 det(lA). 

From (2.14), det (L p v ) det (L p p ) det (<5%) = 1. The result follows. 

2.6 Note that if detL! = 1 and detL 2 = 1 then det F| detL 2 = 1. 

2.7 S' p = L p p L v x 8 p x = L p p L v p = 8 P using Problem 2.2. 

2.8 Using (2.3), &>' = ®cosh0 — Zsinh0 

= o)(cosh 6 — sinh 0) since a> — k 
= e~ e co. 

Since v/c = tanli 0, the result follows. 

2.9 Jacobian is dQt^dx'^/dx v ) = det(L M u ) = 1. 

2.10 The operation of space inversion can be written as x p = P p x v . Then the tensor 
e pv \. p , transforms as 

J — p“ pP py p8 p „ . 
fc pv^.p — r P r v r l r P b aPyS 

= £ jtvkp det P == SfiyXp’ 
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Chapter 3 

3.1 Let Xi(i = 1, ...,3N) be the Cartesian coordinates of the particles. Since x; = 


x,(q), x, = (dxi/dqj)qj. Then T = (m/2)x,X; = (m/2)(dxi/dqj)(dxi/dq k )qjq k . 


3.2 


d E 
dt 


-f 


, d (d£ 


dt \d<p 


d£ ■■ d £.. d£., 


</>— — + -^-</>' 


d <p' _ 


dx. 


Integrate by parts the term —(9£/9</>')(90/9x) and use (3.12). 

3.4 Use orthogonality and the dispersion relation (3.20). Note that H and P‘ form a 
contravariant four-vector (H, P). 


3.5 Varying i/r*, 


SS = I S£dt d 3 x 


-/ 

-/ 


, Jl/r 9(9l/r*) 

- (1/2 H a * * - -IT 2 * 

■ (l/2wi)V(«5t/r*) ■ Vi/c - <5i/r* Vi/r] dr d ; 


x. 


Integrating by parts the terms involving 9(<5i/r*)/9r and V(9i/r*) gives 


95 = 


-I 


-(1/i)— + (l/2rw)VV - VlA 

dt 


S^*dt d 3 x. 


Since this is true for any 8i/r*, the integrand must vanish. Hence 

— —(1/2 m)V 2 /r + Vi jr. 


Chapter 4 

4.1 £ = -(1/4 )F IXV F^ V - J^A^. From (4.16), f 01 = -E x = -F ou F n = -B 2 = F n , 
etc. 

4.2 A -» A' = A — V/. We require V • A' = V ■ (A — V/) = / — V 2 / = 0. The solu¬ 
tion is 


X(r, r) = - — 


-J-f^dV. 

47T J |r — r'| 


4.3 


4.4 


Foi — (eomF~ 3 + eoi32C 3 “)/2 
7 23 7/32% 


A = 


= {F li - F iz )/2 = (~B X - B x )/2 = -B x , etc. 

1 


s/2wV 

1 

V2«V 


[(£, + ie,)e i(fe -“ f) + (e x - i£ v )e- ifc - w,) ] 


[2 cos(£z — &>r), —2 sin(A:z — cur), 0], 


9 A 2co 

E = — — = y — [sin(a>r — Lz), — cos(&>r — Lz), 0]. 


By inspection, on any plane of fixed z, E rotates in a positive sense about the z-axis. 
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4.5 If the fields vanish at infinity, a term 3/(AoF 0 ') = 3 M (Ao,F 0p ) does not contribute to 
the energy. Thus the energy density is not unique, and we may take 

= -F 0p 3 0 A M + 3 /t (A 0 F 0/i ) + 

= -F° m OoA m - 3 m A 0 ) + ^F tlv F ,iV , 

since in free space d^F 0 ^ — 0 by (4.8), 

= -F**F 0fl +^F llv F' iV . 

4.6 L = \mx 2 — qcp + qx ■ A, p‘ = (3L/3x') = mx‘ + qA' are the generalised 
momenta. The equation of motion (dp'/dr) = (dL/dx 1 ) is 

mx l + q(dA'/dt) + q(dA‘/dx^)x 2 = —q{d(p/dx l ) + qx\dA^/dx l ), 

giving 

mx l = qi-idtp/dx*) - q(dA'/dt)] - qF ij Jc j 
(noting 3‘ = —d/dx\ and definition (4.6)). Taking 1 = 1, 

mx — q(E x — F n y — F 13 z) 

= q(E x + yB z - zB y ), 

and similarly for the other components. 

H( p, x) = p'x‘ — L 

— p (p — qA)/m — [(p — qA) 2 /2m — qcp + q {p — qA) ■ A/m] 

= (p - qA) 2 /2m + q<p. 

4.7 f Ldt = f (yL) dr, where dr = d t/y is Lorentz invariant (see (2.5); r is the ‘proper 
time’). Hence the result. 


Chapter 5 

5.3 Under the transformations (5.19) and (5.20), 

V^rVl = iArN+M \p L = iArVt. 

VtVr = iAlM+NiAr = i/tVr, 

'Ar^V'r = V^rN^NiAr = L /M v ip R (x v \p R , 
iAlCT^iAl = ipl/M* v \j/^ h G v \p R , 

iArCt m ctVl = 'AlM t cr /i MN t o- v NiAL (sinceMN f = I) 

= L M x L%iAj?<^CT p iA L , etc. 

5.4 Using (5.28), (5.31) becomes 

tA^/S (i/63o + i/3ar,-9; — m) \[r = tA^ (i3o + ia, 3, — fim) \p since pi 2 = I. 
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5.6 


i $Y s t = i(^L.^R)( ® 0 o )( 



This is invariant under proper Lorentz transformations, but changes sign under the 
parity operation (5.27). 


5.7 The results follow from the definitions (5.30) and (5.4). 


! Chapter 6 

*+*+= l ~((+ 1 e-*/ 2 , (+1 e e/2 ) ( e e ;;| 2 ;; } ) 

= ^[e- 0 (+1+) + e fl (+|+)] 

= cosh0 = y = E/m. 

From (6.14), probability of right-handed mode 

e e e B 1 / v\ u 

= — ; -tt = - = - 1 + - , since tanh 9 — 

e 6 + e~ B 2cosh 9 2 \ c) c 

6.3 n^(p)w + (p) = \{e e + e~ e ) = cosh 0 = E/m, etc. 

w+(p)w-(p) = 0 since (+| —) = 0. 

Note that 

ct-pI + > = |+> and <r-p|-)=-]-) 

implies 

o - -(-p)l+>= - l + > and <r-(-p)| - ) = |-). 

6.5 | + ) and | — ) are evidently normalised, and by direct substitution and the use of 

trigonometric identities, er-p | +} = | + ), er-p | — } = — | — }. 


Chapter 7 

7.1 This follows using the orthogonality properties of plane waves and those derived in 
Problem 6.3. 


7.2 For example, 

i// c + = -iy 2 i/i* = (i/V2je- Kpz - E,) ^ 
ander 2 |+) = i |—), giving 

f c + = (l/V2 y~ i(pz - E 'l 


0 —a 2 \ / Q~ e ! 2 


cr 2 0 


1+) \ 
• 6I2 I +)) 


e 0/2 |-> 

- e -«/ 2 I -) 
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7.3 Under the parity operation. 


iAl -* Vfit, 5^3^ cr" 3 M , 
from (5.26) and (5.27). Under charge conjugation, 

Vfir ier 2 ^. 

Hence under the combined operations, 

iy^cr^Vh. -> v^1a 2 a ll a 2 d ll fl = -i3 M ^(a 2 cr' i o- 2 ) T i/f L 


(recall the - sign that must be introduced when spinor fields are interchanged). But 

(ct 2 <7 M CT 2 ) T = 

Finally, integrating by parts in the action yields the Lagrangian density ii/r^CT'' 3,, i//| . 

7.4 V/R = NiA r by (5.20). 

icrV* -> icr 2 N*iA*. 

But er 2 N* = Mct 2 . This is true for M and N given by (5.24), and holds in general. 

7.5 Varying <t> in the action gives 

.s-/(-[«. + *W»-]K .A*-*) d,* 

= / <5$*{(i3„ - q A M )(i3 M - q A»)<f> - m 2 $} d t d 3 x, 


after integrating by parts. Since this holds for any c5<J>*, the Klein-Gordon equation 
follows. 

7.6 If $ —»■ e‘“4> with a = a(x) small, 


(i3„ + qA^*) = e‘“(i3 M + qAJ* - (3 M a)e‘“3> 

SS = I {-(a /1 a)$*[(ia' t - qA^] + [(i3^ + qA^ns^a)®} d t d 3 x 


/ 


= / a(x)3 M {$*[(i3 ,t - qA 11 )®] - [(i3 M + q A")$*]$} dr d 3 x, 


after integrating by parts. Hence the current 

= i[$*(3^$) - (3"$*)$] - 2 qA^*^> 

is conserved, as is also qj ,J . (Note that qj 11 = —d£/dA^ is the electromagnetic 
current.) 
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7.7 Verify by direct calculation, e.g. for positive helicity and taking p, = 3, 
qj 2 = —ei/f + y°y 3 i/f 


0 


-e /2 


1 +) 


r 0/2 


1 +) 


= - (e/2) (e-^ 2 <+|,e 9/2 <+|) 

= — e sinh 9, sincecr 3 |+) = | +>. 

7.8 This follows since the electric field lines are reversed in direction, E —* E' = 


-E. 


7.9 Assuming p(t) —► p'(t') = p(— t). Maxwell’s equations retain the same form if E 
E' = E, B —> B' = —B, J -* J' = — J, or equivalently 

4> ^ 4>' = </>, A —> A' = —A. 

Taking the complex conjugate of (7.6) and multiplying on the left by y l y 3 gives 

Y X y\y v * = 0- 

Now 

J / 1 / 3 (y 0 )* = y 1 y 3 y° = y°y 1 y 3 , 

Y 1 Y 3 (y 1 )* = ~Y I Y 1 Y 3 for; = 1,2, 3, 
and the result follows. 


Chapter 8 

8.3 If an e + e _ pair is created there is a frame of reference (the centre of mass frame) 
in which the total momentum of the pair is zero. The photon would also have zero 
momentum in this frame and hence zero energy: energy conservation would be vio¬ 
lated. 


Chapter 9 

9.1 Conservation of energy gives m n — E e + E y . Conservation of momentum gives p e = 
p v . Also 

E y = /2-vi Ee = Pe m e , U e = Pe/E & . 

Hence 


(m n - 

- Pef 

= Ee 2 

= Pe 2 + Me 2 , 

m, 2 — me 

Pe= 0 

Zm n 




m n 2 + m e 2 


Eq - 

= m n - 

- Pe = 

2 m n 


L( 

l 

~( 

j m n 2 — m 2 ' 

\ m 2 

2 V 

c / 

2 \ 

m, 2 + m e 2 ■ 

l m n 2 + me 2 
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9.2 Final energy E = E e + E y = E e + p e 


d E d E e P e E c + P c M n 

-= —- + 1 = — + 1 = e = — 

d p e d p e E c E t E e 


9.3 Using Problem (9.1), 


0-t) 


- PcE e 


4/77 3 


/ 2 2\ 2 
= ytn w -m e ) , 


with a similar expression for the p leptons. 


9.4 Since the pion is at rest, only the term 34>/3r contributes. From (3.35), there is a 
factor in £ int arising from this: 

1 (-i m n ) 




rfl 0 - 


From Problem 6.5, the v factor is 


1 d* e i(- P'- r ) |_\ , 


From (6.24), the e\ factor is 


VvUr P V2 (+lp - 


(Only this helicity term contributes.) 

Integrating over volume gives p' = — p and a volume factor V, so that, for a 
given p, 

(-i) [m^ fm7 a* __ e/2 


9.7 


(Note that |-)_ p = |+) p .) 

Hence the transition rate s is obtained. The factor 4 jt in the density of states comes 
from summing over all directions of p. Also ( E e /m e ) = cosh 0 and e °/ cosh 0 = 
(1 — tanh0) = (1 — v/c). 

r 3\ 1/2 


/ 1927T 3 V 

V tm lx 5 ) 


= 1.164 x 10~ 5 (GeV) . 


9.8 The square of the centre of mass energy 

s = (E e + EyY - (p e + p-v) z 
is Lorentz invariant. In the electron’s rest frame 

^ = (m e + E -v) 2 - pi = ml + 2m e E y . 

9.9 The expression (9.8) contains the term 

+ ""‘VeLV^trCL. 


-2V2 G F g„ v el^ . f 
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9.10 


The expression (9.15) contains the term 

(^G F /V2jg lll ,vl iL d IM v llL ^ e y v (c v - c A y 5 )ir e . 

t(K-+ |xv^) w 2 (m| - ml ) 2 


_ _ K _ 

t (K —> e"V e ) (ot^ — m 2 ) 2 

2 


= 2.57 x 10“ 


1 


r (K (x-v M ) 4-tt 


«K / U|i\ 2 F 


(Cf. (9.3)), 


where (l - —) m K 2 - w^ 2 ) 2 

x C / 4/?2 K ~ 

(cf. Problem 9.3). 

This gives ax = 5.82 x 10“ 10 MeV -1 , and a n = 2.09 x 10 9 (text), giving 
a K /a n = 0.28. 


9.11 Consider the decay r —*■ n + v T . The term in £ int that generates the decay is 

v\ l g^ r L 9^ f . 


Consider the r to be at rest with its spin aligned along the 2 -axis, and the neutrino 
momentum to be p. The pion momentum is then (— p), and the interaction energy 
contains a term 

5fvS 4( “ p)i ’ !(p,i ’' (0,H ' (<r ” £ '-‘ ,p) 7l(i)- 


Now (— l p (a°E 7r — erp) = (— | p (E n + p y ) = (—| m T , and from Problem 6.5, 
(— | p = (— sin(0/2)e'^, cos(0/2)) where 0 and <j> are the polar angles of p. 
Hence 

(jr_„ v„|l/| t) = si n (( 9 /2) e i*. 

The decay rate is 

X -=2n I \(f\V\i)\ 2 p(mjdtt 


where 


p(m T ) 


V (m 2 - m n 2 y E n 
(27r) 3 4m T 2 m T ’ 


and the angular integration gives a factor 27t. 


Chapter 10 

10.1 The term —(777 2 /2</>o 2 )\/2</>oxV f2 links the / and i J/ fields, and m = m x /s/2. Since 
the t/t particles are massless, the final energy E = 2 p, and the density of states factor 
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for the decay is 


P(E) = 


V ~ dp 
(27r) 3 4jlp d E 


dp 1 
where — = 
d E 2 


and the factor 4 tt comes from the angular integration. 

In the matrix element (p, — p | V | / at rest), the y field gives a factor 1 / J2m x from 
the expansion (3.21), and each of the i j/ fields gives a factor l/^/2p. Hence 

- 4 1 1 4itp 2 1 


2tt\{tp\V\i)\ p{E) = 2 tt - 


_ "H 

1287r \ (po 


2m x 4p 2 (27r) 3 2 


10.2 


The decay of an isolated vector boson requires a term in £ int linear in ,4 /( . There is 
a term A M /i that allows the decay of the scalar boson if energy conser¬ 


vation can be satisfied, i.e. ni/, — *j2m 



Chapter 11 

11.1 The term UWU f satisfies (UWU t ) t = UWU+ and Tr(UVVU') = 
Tr(UtUW) = Tr(W) = 0. 

Noting that (a ■ r) 2 = I and (3 / ,u ,/ )ci' / = (J since« , a' / = I, the term 
(2 i /g2)(3/iU)U' may be written as a linear combination of the matrices r-' 
with real coefficients. Each r 7 is Hermitian and has zero trace. 

11.3 Thelasttermmaybewrittenas(g2 2 0o 2 /4)(VT M 1 VT lM + 2 W 2 ^), and in the absence 

of electromagnetic fields the term that precedes it can be handled similarly. There 
are therefore two independent fields each with mass gifio/si?- (cf. Section 4.9). 

11.4 The interaction Lagrangian density (11.32) contains a term gy 2 /\/2)hW' W +jl cou¬ 
pling the h field and the charged W fields. 


h 
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11.5 Consider 


Then 


and 


using 


Hence 


U = cosal + i sin ax • a (seeB.9). 


. it 0 0 ^ ^ 

U = cosal — isina(r a — r a + r a) 


r 2 U* = [cosal + isina(r 1 a 1 + r z a z + r z a z )]r 


1-1 , _ 2-2 , ^ 3 a 3\ 1t 2 


2 1 1 2 2 3 3 ? 

r t = — t r, r r = —r r . 


ir 2 U* = U(ir z ) and it 2 = 


0 1 

-1 0 


The result follows. 
11.6 Using (B.9). 


U = cos al + sin a(sin <pr 1 + cos </>r 2 ) 

" cos a i sin a(sin 0 — i cos </>) 

isina(sin0 + icos0) cos a 


Chapter 12 

12.2 Take the two fields to be 



To maintain local gauge invariance, the dynamical term in the Lagrangian density 
must be l2 <7^4(9^ + i(g 2 /2)W At )L. 

There are terms which mix L\ and Li, for example, 

-(g 2 /2)Li t ff'‘0IV I -iW^ 2 )L2 

= -(g2/2)L^^L 2 W^. 

The operator W^ destroys electric charge <?, so that to conserve charge L | a 1 ' L 2 , 
must create charge e. 


12.3 


The Higgs particle at rest has zero momentum and zero angular momentum. Hence 
the e + and e - have opposite momentum. If they had opposite helicities, they would 
have to carry orbital angular momentum with a component +1 or — 1 along their 
direction of motion, to conserve angular momentum. This is not possible since 
p • (r x p) = 0. 

The final density of momentum states is 


P(E) = 


V 


Anp e 2 


d p e 

d E ' 
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The final energy E = 2E e ,where E 2 
d p e _ 1 d p e _ E e 
d E 2d E e 2 p e ' 


= m e 2 p e 2 . Hence 

v 

and p(E)=——rp e E e . 
(2 n) 2 


The interaction term in (12.9) is — (c e V 2 )h\ln[r . From (6.24) and (3.21), this gives 

or 


[/x_(p)n_(-p)]. 

Now/x±(p)i>±(—p) = sinh 0, and E e /m e = cosh 6. Hence the decay rate to positive 
helicities is 

2n\{f\V\i)\ 2 p{E) = tanh 2 6-^— Pe E e . 

2 (27T ) z 

Also tan 6 = v e /c — p e /E e and E e = mu/2. The decay rate to negative helicities is 
the same, and the result follows. 

12.4 Since c z > c M > c e (see (12.13)) the decay to r + r _ dominates in the leptonic partial 
width. Also, since the Higgs mass is much greater than the r mass, v z ~ c. Hence 

T ^ _c[_ _ / m z \ 2 

mu 16 tc 16 rt V 0o / 

Chapter 13 

13.1 In the rest frame of the W, and neglecting the lepton mass, pi = — p v , E/ = pi = 
M w /2, and p 2 — M w 2 / 4 = p x 2 + p 2 + p 2 . Taking thex-axis to be the beam direc¬ 
tion, the mean square transverse momentum is 

~P? + ~p7 = (2/3 )p, 2 = M w 2 /6. 

13.2 From (12.23), the Z /( is produced by right-handed electron fields with a cou¬ 
pling e tan0 w = 2e sin 2 0 W / sin(20 w ) and by left-handed fields with a coupling 
—e cos(20 w )/ sin(20 w ). In head-on collisions at high energies the right-handed com¬ 
ponent of the electron (positron) has positive (negative) helicity. Hence the total spin 
is +1 along the electron beam direction. The spin of the left-handed components is 
opposite. For unpolarised beams the left-handed and right-handed components are 
equally populated, and the result follows. 

13.3 Consider the decay W~ —>- e~ + v e in the W~ rest frame. With no loss of general¬ 
ity we may take the W to have J — 1, J z = 0 (see Section 4.9). The interaction 
Lagrangian density responsible for the decay is (from (12.15) and (12.16)) 

£=-(g 2 /V2)j 2 Wi. 


7fea£A.&wta£teaf 







Hints to selected problems 


261 


If the electron has momentum p, the neutrino has momentum —p. Neglecting the 
electron mass (see Problem 6.5) the matrix element for the decay is 

(/| V \i) = -%-^== (“I 0-3 l+> ■ 

V2 V2 M W V 

(Recall cr-p |—) = — |—), cr ■ (—p) |+) = — ]+}.) Also, from Problem 6.6, 
(—| cr 3 |+) = — sinde 1 ^. The decay rate is 




V 


3 9 


</l viordn— 3 pS-£ 


(2tt) 


d E 


, by (12.22). 


where dp e /dE = 1/2, p e = M w /2, giving 

r =^-m = GpMw3 

48tt w 67T V2 

The decay rate for Z — »■ vv requires a similar calculation, with M w replaced 
by M z and the coupling constant g2/s/2. replaced by e/sin20 w = g2/2cosO w = 
g 2 M Z /2M W . (We have used (12.23), (11.38) and (11.37a).) Then 

G f M 3 

r(z -vv) = * 

12ttV2 

There are two terms in (12.23) contributing to T(Z — > e + e~), yielding 
T(Z —*■ e + e~) = T(Z —* vv)[(2 sin 2 0 W ) 2 + (cos20 w ) 2 ]. 


13.4 83.86 Me V. 


Chapter 14 

14.3 Under an SU(2) transformation, and from Appendix A.2 
(ch T eL) -» (<t> T U T eUL) 


"Uaa 

Uba 

0 

r 

"Uaa 

Uab 


0 

Det(U)" 

_ Uab 

Ubb . 

-1 

0_ 

. Uba 

Ubb _ 


-Det(U) 

0 


= (Det(t/))e 

= e, since Det(U) = 1. Hence (4> T t/ T et/L) = (4> T eL) 

14.4 From (11.23), 

4> = ( ° r \. 

\<h +h/V 2 ) 

Inserting this in (14.6) gives the coupling terms 

-(1/V2)^[ Gfjdld Rj h + Hermitian conjugate. 

Similar terms arise from (14.9) and (14.10). Using the true quark masses these 
become 

- (1/V20o) J>f(4<4i + d^du) + m u i (u { u um + m^wli)]/?. 
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The coupling to the top quark is 

m, _ 180 GeV 

' ~~ V20o "V2x 180GeV 

14.5 For K + —*■ /x + + v M , the terms 


from 7 ^, from j* 


contribute in the second order of perturbation theory. ( See (a).) 



d cl 




For D + K° + e + + v e , 

s^o-^clV* from y M , from j' 1 '. (See (b).) 

For Bt D° + jrt, 

b'id ll c-LV* h from / M , from j' 4 . (See (c).) 
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14.6 




The quark labelled u, can be u, c or I 


Chapter 15 

15.1 The decay rate for Z — »■ dd of (15.3) can be compared with the decay rate for 
Z — »■ e + e” of (13.3), calculated in the answer to Problem 13.3. Comparing the inter¬ 
action Lagrangian densities (12.23) and ( 14.14), the term in the left-handed coupling 
cos 20 w =1 — 2 sin 2 0 W is replaced by (1 — (2/3) sin 2 # w ), and in the right-handed 
coupling 2 sin 2 0 W is replaced by (2/3) sin 2 0 W . Including a colour factor of 3 and 
replacing sin 2 0 W by (1/3) sin 2 0 W in the rate (13.3) gives the rate (15.3). 

Similarly for Z —> uu. Comparing (12.23) with (14.14), sin 2 0 v; is replaced by 
(2/3) sin 2 0 W . 

The decay rate W + —>■ u,d ; of (15.6) can be compared with the rate W + -> e + v e 
of (13.2) calculated in the answer to Problem 13.3. Comparing the interactions 
(12.18) and (14.20), go/V^ is replaced by eVy/\/2 sin£? w = g2 Vjj / -s/2. Including 
the colour factor of 3, the rate (15.6) follows from the rate (13.2). 


Chapter 16 

!6-l = d fl G v - d v G fl + ig( G„G„ - G„G m ) 

= (3 M G“ - 3 v Gp(A a /2) 

+ Kg/4)(G b /l G c v hX c - GlG^Xb), 

and 

( l-h ( X C X[ ? ') — ’Hfbca'Xa (see (B.27)). 

Hence 

Gfiv = [(d/j,G“ - 9 U G“) - gfabcG^iXa/D. 
16.2 These are the terms in (16.9) cubic and quadratic in the G fields. 
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16.3 Variation of G" gives 


8S = 


= J -(l/2)G^ v SG%-gJ2^fy V SG a v (K/2)q f 


d 4 x. 


and 


-(l/2)G flMU 5G“ = -G a ' lv d ll (SG a v )+ gG c ^G b JGlf cba . 


(There are two equal contributions to the right-hand side.) Integrating by parts gives 

8G a „ d 4 x 


55 = / 9 M G fl ^ - gG c ^G b J ahc - g 
J L / 
(fcba = fabc')- 


Since the 8G" are arbitrary (16.14) is obtained. 

Q 2 /Am 2 = e 12 * 2 /® 2 = e 3,r/ “ = 10 560 . 
2m ~ 1 MeV, Q 2 ~ 10 560 (MeV) 2 . 


16.5 Take Qr = Qr cos 0 and d 3 Q = Q 2 dQ d (cos 6)d(p where ( Q, 0, <p) are the polar 
coordinates of Q, with r taken to be (0, 0, r). 



s 



From (14.15), the interaction terms in udW + and usW + contain factors V u( j and 
V us , respectively. Problem (9.10) shows o'k/o'.t ~ 0.28. Setting this equal to V us / V u d 
gives sin 0i2 0.27. 
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18.2 The internal wave function of two pions at ri and r 2 in an S state is a function of 
only |ri — r 2 | and |ri — r 2 | is invariant under both C and P. Hence 

CP 1 7T 0 7r°) = 1 7r°7r 0 ) and CP \n + tt~^ — \n + tc~) . 

18.3 The internal wave function of three pions at r | , r 2 , r 2 , depends only on two relative 
coordinates, say ri 2 = r 2 — rj and r 22 = r 2 — r 2 . To be invariant under rotations (/= 
0) the internal wave function can be a function of only three scalars: r| 2 ■ ri 2 , r 12 • r 2 3 , 
and r 2 3 • r 2 3 . These are invariant under C and P. Since the intrinsic parity of the 7t 0 
is negative, 

^ n IJJJl |_0_0_0\ 

CP yn n re) = — \ix n 7t ). 

18.4 The area of the triangle formed by the origin and the points r 2 = (x 1; y 1; 0) and 
r 2 = (x 2 , y 2 , 0) is 

(l/2)|r 1 x r 2 | = (l/2)|x 1 y 2 —x 2 y 2 )| 

= (l/2)|Im(z*z 2 )|, 

where Zi = X\ + iyi, z 2 = x 2 + iy 2 . Hence the area of the unitary triangle is 
(1/2)| Im( V*^ V llb V cd V* b )\ = J/2. 

18.5 All the complex numbers Zi are transformed to z] — e l( ^ d ~ e ^Zi and the triangle is 
rotated through an angle (Oj — 6b). 


Chapter 19 

19.2 (a) (.UfjUajUpiUZi) = (U^UaiUpjU^r hence 

Im( UpjUajUpi U *') = -lm(U* n U al U Pj U*j). 

(b) Since U is unitary, 

E Fpaij = Im (dapU Pj U*j) = lm(\u aj \ 2 ) = 0. 

i 

As two examples Fp aU + Fp a31 = 0 and Fgat 3 
Hence Fp a n + Fp a23 = Fp a3 \. 


(c) 


Ffia23 — 0. 




Amj-L 

peij sin( 2£ ) = 


sin(- 


Ar«2i^ 


-) + sin(- 


A m\~,L 


2 E 2 E 

. (Am 21 + Am? 2 )L 

■ sin --—) 

2 E 


) 


and the result follows. 


Chapter 21 

21.1 Let (ia 2 v*) 1 a 9 (icr 2 v*) = E 
Inserting explicit spinor indices 

E = Vi <r 2 ij 9 k crjz[ 9 m v* , (repeated indices summed). 
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But from the algebra of Pauli matrices a 2 -a '' k = aft. Taking account of the 
anticommuting spinor fields E = — 3 M v*a^ v ,. and discarding a total derivative that 
makes no contribution to the action 

E = v*a= v^a^d^v. 

21.2 Inserting explicit spinor indices 

vJo- 2 V / j = V ai afjVpj = V ai (Tjj Vpj = V# CTj) V oi = vJ(J 2 V a . 

21.3 From (21.15) 

tjM jjM* _ TjD AAj ttD* -lAj _ jjD ijD* 

U Pi U ocj ~ l /:i C U aj e - u ftj U aj ■ 

Appendix A 

A.l The equation holds for afi ... v = 1,2,Interchanging, say, a and fi is equiv¬ 
alent to interchanging column i with column /, and gives the same sign change. 

A.3 M = (M + M*)/2 + i(M - M*)/2i. (M + M*)/2 is Hermitian, as is (M - M*)/2i. 
A and B, and hence M, can be diagonalised by the same transformation if and only if 

AB - BA = 0, i.e. (M + M f )(M - M f ) - (M - M*)(M + M*) = 0 

or 

MM - MM* = 0. 

(This condition is satisfied if M is unitary.) 

A.4 Since (MM*)* = MM*, we can find Ui such that Ui(MM t )U'[ = M D 2 . M D 2 has 
diagonal elements > 0, since M D 2 = IJ| M( U | M)*\ Thus we can choose M D with 
real diagonal elements > 0. If none are zero, M D can be inverted. We may then define 

H = Ui t M D U 1 = H*\ and V = H _1 M. 


Hence 

VV* = H _1 MM*H _1 since (H —1 )"* = H 1 
= H-'U^Mo^jH- 1 
= Ui t M D - 1 UiU 1 t M D 2 UiU 1 t M z) - 1 Ui 
= I, since UiUi*’ = I. 

Thus V is unitary, as is UiV = U 2 . 

Finally, M = HV = U^MoUiY = U*M D U 2 . 

Appendix B 

B.l A unitary transformation, H —> H' = VIIV* = H D , say, also diagonalises each term 
of U and hence 

U -* U' = VUV* = U D = exp(iH D ). 

~Pu* ic. 7feaf/taiHa£t<ittf P/ty.Atc.A 
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detU = detU D = ]~~[ exp i(H D )„„ 

n 

= exp (H d )„„ 

n 


exp[iTr H D ]. 


But TrH D = TrH. Hence if TrH = 0, detU = 1. 

B.2 The SU( 2) matrices corresponding to Rq\(6) and Rq2(0) are respectively 

/ cos(0/2) isin(0/2)\ / cos(0/2) sin(0/2) \ 

V i sin(0/2) cos(0 /2) ) “ V - sin(0/2) cos(0/2) J 


and the correspondence can be checked directly. 


B.3 From equation (B.5), using (B.12) and Problem B.2, R(ip', 0, (p) corresponds to the 
product 


/ e'^/ 2 0 \ / cos(0/2) sin(0/2) \ /e 1 ^ 2 

^0 e-^/ 2 J V-sin(6»/2) cos(0/2 )) Vo 

B.4 Under a Lorentz transformation, 1—> f = Ml, r —> r' = Nr. 
Hence 



l^cUr-^ l i 'M t <7 / V l 'Nr 

= ltMtff' i MN t cr''Nr since MN f = I 
= tfL>* l d k L v p cj f ’r from (B.17) and (B.18) 
= L* k L%($5 k a'> r). 


It is easy to verify that 

~ii—V , ~ v _u 

G O (7 O^ 


0 if /x ^ v, 

2 if p = v = 0, 

—2 if pv — i\ f = 1,2, 3. 


B.5 Equation (B. 10) gives 


X(jc) = x‘(t‘ 

X , (x , ) = x n o' = R'jX^cr'. 

Also X' = UXU f = Vx j (x j \f. The x j are arbitrary. Hence Ufj / U t = R'er'. Multi¬ 
plying on the left by a k and taking the trace, 

Tr(a*Ucr-'U t ) = R'^TrCerV). 


Now 


Hence the result. 


Tr(cr k (j i ) = 


2 

0 


if k — i, 
if k ^ i. 


~PutiG- 7feo£A.&wta£teaf 
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B.6 From (B.17), M^rr^M = L 11 -, 5 X . Multiplying on the left by a v and taking the trace, 
the result follows, since 


Trtcr'V) 


2 if X — y, 
0 if X ^ v. 


Appendix C 

C.2 The ground state is given by a|0) = 0, or (X + iP)|0} = 0. In the Schrodinger rep¬ 
resentation. P = — id/dX, so that (X + d/dX)i[ro = 0, giving i/^o = AeT xl / 2, where 
the constant A is determined by normalisation. 

€.3 NibS\0) =b?bibl |0> 

= *,■*(!-fc^OlO) =i,.t|0). 


Appendix D 

D.l Or = (p - p') 2 -(£- E') 2 

= (p 2 - E 2 ) + {p' 2 - E' 2 ) - 2p p' + 2 EE'. 

But E 2 = p 2 + m 2 , E' 2 = p' 2 + nr, so that, neglecting electron masses, 

Q 2 = —2pp' cos6 + 2EE' — 2EE\\ — cos0) = AEE' sin 2 (0/2). 

The energy and momentum of the recoil proton are given by E p = M + 
E — E', P = p — p'; also E p 2 — M 2 + P 2 . Hence 

Qr = p 2 - {E - E’) 2 

= (M + E - E') 2 - M 2 -(E - E') 2 
= 2M{E - E’) 

so that (D.3) follows. 

D.3 Q 2 = 2EE'(1 -cos (9) 

v = E -E' 

, d(Q 2 , v) 

dQ 2 dv = — d(cos(9)d£' 

3(cos 0,E’) 

where the Jacobian of the transformation is 

-2 EE' 2E{\ — cos 0) 

o -i =2EE - 

Hence the result. 
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